1 

A 



2 32 

CGA GTA AAT CGT TCT CGA AGC CTT AGT AAT AGC AAT CCA GAT ATA TCT GGG ACT CCC ACG 
arg val asn arg ser arg ser lea ser asn ser asn pro .asp ile ser gly thr pro thr . 

62 92 

TCA CCA GAT GAT GAA GTT CGA TCA ATC ATC GGG AGT AAG GGT TTA GAT CGC TCC AAT TCC 

ser pro asp asp gla val arg ser ile ile gly ser lys gly lea asp arg ser asn ser 
122 152 

TGG GTT AAC ACT GGT GOT CCA GCT GCC CCA TGG GGA TCC AAC CCC AGT CCA AGT GCA 

trp val asn thr gly gly pro lys ala ala pro trp gly ser asn pro ser pro ser ala 
182 212 

GAA TCA ACA CAG GCT ATG GAT CGA AGT TGT AAT CGT ATG TCT TCG CAC ACA GAG ACG TCA 
ql~i& ser thr gin ala met asp arg ser cys asn arg met ser ser his thr gla tax ser 

24l 272 

A§i TTC TTA CAA ACA TTA ACG GGA CGC TTA CCA ACT AAA AAG CTT TTT CAC GAG GAG CTG 

seS phe lea gin thr lea thr gly arg lea pro thr lys lys leu phe his gla gla lea 

* T r *j 

30l 332 

Gf& TTG CAG TGG GTT GTT TGC AGT GGC AGC GTT CGG GAA TCA GCT TTG CAA CAA GCC TGG 
ala leu gin trp val val cys ser gly ser val arg gla ser ala leu gin gin ala trp 

3i62 392 

dlC TTT TTT n&& TTA ATG GTA AAG AGC ATG GTG CAC CAT TTA TAC TTT AAT GAT AAA CTT 
phe phe glu leu met val lys ser met val his his leu tyr phe asn asp lys leu 



7Z Cadherin 

422 | xxx cleavage xx\ 452 

GAG GCT CCA AGG AAA AGT CGT TTT CCA GAA CGT TTC ATG 

glu ala pro arg lys ser arg phe pro glu arg phe met 



GAT GAC ATT GCA GCT CTT GTC 

asp asp ile ala ala leu val 



482 512 

AGC ACG ATT GCT AGT GAT A*ra GTT TCA CGA TTT CAG AAG GAC ACA GAA ATG GTT GAG AGA 

ser thr ile ala ser asp ile val ser arg phe gin lys asp thr glu met val glu arg 
542 572 

CTC AAT ACA AGC CTT GCA TTC TTT CTC AAT GAT CTG TTG TCT GTT ATG GAC AGA GGA TTT 

leu asn thr ser lea ala phe phe leu asn asp leu leu -ser val met asp arg gly phe 
602 632 

GTT TTT AGC CTT frT* AAG TCC TGC TAT AAA CAG GTG TCT TCA AAG CTT TAC TCA TTA CCG 

val phe ser leu ile lys ser cys tyr lys gin val ser ser lys leu tyr ser leu pro 
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662 

AAT CCC AGT GTT CTG GTG TCC TTG AGG CTG 
asn pro ser val leu val ser leu arg leu 

722 

CAC TAT GTT ACA TEA AAC TTA CCC TGC AGC 
his tyr val thr leu asn leu pro cys ser 

782 

TCT GTT TCT TCT GCA ACA TCP CAG AGT TCT 

ser val ser ser ala thr ser gin ser ser 
842 

ATT GCA ATG TTT GAA TTA TCC GTG CCT 

ile ala asn met phe glu leu ser val pro 

Cadherin 
| xx EC motif xx | 

.GTG TTA ACA GAG CTG GCT GTC ATT TTA gac 
v^lI leu thr glu leu ala val ile leu asp 

CAAG AAA GTC ATC AAT ATG GTA CAC AAT TTA 
?iys lys val ile asn met val his asn leu 

f|022 

TCT GAC CCT CAG ATA AAG GCT CGA GTG GCC 
; per asp pro gin ile lys ala arg val ala 

[J082 

I .ATG GAA ACT GTA CCT CAG CTG TAT GAT TTT 

^5aet glu thr val pro gin leu tyr asp phe 
""1142 

ATT TGT ATR GCC ACT GAT GAT TAT GAA AGT 
ile cys ile ala thr asp asp tyr glu ser 

1202 

GCC ATG GCA ATC GCA GGG ACA TCG GTC CCT 

ala met ala ile ala gly thr ser val pro 
1262 

ACG TCA ACG AGT GGC AGG CAA CAC ACT ACC 
thr ser thr ser gly arg gin his thr thr 

1322 

ATC TGT CTA CTT TGG GTT CTC AAA AAT GCA 
ile cys leu leu trp val leu lys asn ala 



692 

GAT TTT CTA CGA ATC ATC TGC AGT CAT GAG 

asp phe leu arg ile ile cys ser his . glu 
752 

TTA CTT ACT CCA CCT GCA TCT CCA TCA CCT 

leu leu thr pro pro ala ser pro ser pro 
812 

GGA TTT TCT ACG AAT GTA CAA GAC CAA AAG 

gly phe ser thr asn val gin asp gin lys 
872 

TTC CGC CAA CAG CAT TAT TTG GCA GGA CTT 

phe arg gin glu his tyr leu ala gly leu 
932 

cct gat get gaa gga ctg TTT GGA TTG CAT 
pro asp ala glu gly leu phe gly leu his 

992 

CTC TCC AGT CAC GAC TCA GAC CCG CGG TAC 

leu ser ser his asp ser asp pro arg tyr 
1052 

ATG TTG TAT CTA CCT CTG ATT GGT ATT ATC 
met leu tyr leu pro leu ile gly ile ile 

1112 

ACA GAA ACT CAC AAT CAA CGA GGA AGA CCA 

thr glu thr his asn gin arg gly arg pro 
1172 

GAG AGC GGA AGT ATG ATA AGC CAG ACC GTT 
glu ser gly ser met ile ser gin thr val 

1232 

CAA CTA ACA AGG CCT GGC AGT TTC CTC CTC 
gin leu thr arg pro gly ser phe leu leu 

1292 

TTT TCA GCA GAA TCA AGT CGA AGC CTT TTG 

phe ser ala glu ser ser arg ser leu leu 
1352 

GAT GAA ACA GTT CTA CAG AAG TGG TTT ACA 
asp glu thr val leu gin lys trp phe thr 
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1412 

™ CTC TCi GTC TTG CM CTi «C CGG CTi TTK GAT CM CTT TAT CTC TGT GTG TCT TGC 

leu gin leu a=n aro leu 1.U asp leu leu tfr leu cys val ser cys 

1472 

S S° ™ ™ « iys lys val pbe * ln ^ — ser lea p 

1532 

AAA GAC ATG AGA GCA AAG CTT GAA GAA GCT ATT CTT GGG AGC ATA GGT GCC AGG CAA GAA 
asp ^ t£ ala lys leu gin gl» ala ±le leu gly ser ile gl, ala arg gin gin 

1592 

aw GX& CGG CGA AGC CGA GGA CAG CTC GAG AGA AGC CCA TCT GGA AGT GCC TTT GGA AGT 

S vat art art -r arg gly gin len gin arg sar pro ser gly ser ala phe gly ser 

1652 

CAA GAA AAT TTG AGG TGG AGG AAA GAT ATG ACT CAC TGG CGT CAA AAC ACA GAG AAG CTT 
S» aS So arg t rp arg lys asp -at *hr his txp arg gin asn t*r gin lys leu 

■hf 1712 

Sc 2 AAA TCA AGA GCA GAG ATT GAA CAC GAA GCA CTG ATT GAT GGA AAC CTG GCT 

^yt sir art ala gin ile gin his gin ala len ile asp gly asn len ala tbr gin 



2? 1772 

^AAC CTA ATC ATT TTA GAT ACA m GAG ATT GTT GTT CAG ACC GTT TCT GTA ACG GAA 
= *£ atn let ile ile len asp thr len gin ile val val gin tfar val ser val *nr gin 

1832 

AAA GAG AGC ATT CTT GGT GGA GTG CTA AAA GTG CTA CTA CAC AGC ATG GCC 
S lyt Sn ser ile len gly gly val len lys val len len his ser «t ala cys asn 

:? 1892 

"caTaGT GCA GTT TAT CTA CAA CAC TGT TTT GCT ACA CAG AGA GCC TTG GTT TCA AAG TTT 
iS Sa vS tyr len gin his cys phe ala *hr gin arg ala len val ser lys phe 

, 1952 

CCT GAA CTC TTA TTT GAA GAA GAG ACA GAG CAG TGT GCT GAT TTA TGC CTC AGG CTT CTC 

^ ^ Sn ™ phe gin gin gin tbr. gl* gin cys ala asp len cys len arg len len 

* 

2012 

CaCAC TGT AGC AGT AGC ATC GGT ACA ATA CGG TCA CAC CCC AGT GCC TCC CTT TAC CTA 

arg £s cys ser ser ser ile gly tbr ile arg ser his pro ser ala ser len tyr len 

2072 

C^ATG AGG CAA AAC TTT GAG ATT GGG AAT AAC TTT GCC AGG GTT AAA ATG CAG GTA CCA 
S 2 art Sn asn phe gin ile gly asn asn phe ala arg val lys mt gin val pro 
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2102 2132 

ATG TCA CTA TCC TCC TTG GTG GGC ACA TCT CAG AAT TTT AAT GAA GAA TTC TTA AGA CGT 

met ser leu ser ser leu val gly thr ser gin asn phe asn glu glu phe leu arg arg 
2162 2192 

TCT CTA AAG ACT ATA TTG ACA TAT GCT ^A& GAA GAT CTG GAA TTG AGG GAA ACA ACA TTT 
ser leu lys thr ile leu thr tyr ala glu glu asp leu glu leu arg glu thr thr phe 

2222 2252 

CCT GAT CAG GTC CAG GAT CTG GTT TTC AAT CTC CAT ATG ATT CTT TCT GAT ACT GTG AAA 
pro asp gin val gin asp leu val phe asn leu his met ile leu ser asp thr val lys 

2282 2312 

ATG AAG GAA CAC CAG GAG GAT CCT GAA ATG TTG ATT GAT CTA ATG tac aga att gcc aag 
met lys glu his gin glu asp pro glu met leu ile asp leu met tyr arg ile ala lys 

2342 2372 

ggt tac CAG ACC TCT CCA GAT CTG CGA TTG ACC TGG TTG CAG AAC ATG GCA GGC AAG CAC 

gly tyr gin thr ser pro asp leu arg leu thr trp leu gin asn met ala gly lys his 

P2402 1 TTTTTT TTTTtTtTxm transmembrane domain xxxxxxxxxxxxxxxxxxx 

QrCA GAA CGA AGC AAT CAT GCT GAA GCT GCA CAG TGT CTA GTC CAC TCA GCA GCA CTT GTT 
SJser glu arg ser asn his ala glu ala ala gin cys leu val his ser ala ala leu val 

x r. ■; 

^IXTTrTTTTTf XTTXTCXXX | 2492 

TlfGCT AAA TAT TTG AGC ATG CTG GAG GAC CGG TAT CTT CCT GTG GGA TGT GTA ACA TTT 

J* ala glu tyr leu ser met leu glu asp arg lys tyr leu pro val gly cys val thr phe 
* 2522 2552 

h CAG AAT ATT TCA TCT AAT GTT TTA GAA GAA TCT GCG GTC TCA GAT GAT GTG GTA TCT CCA 
III gin asn ile ser ser asn val leu glu glu ser ala val ser asp asp val val ser pro 



2582 2612 

GAT GAA GAA GGT ATC TGC TCT GGA AAA TAC TTT ACT GAG TCA GGA CTT GTG GGA TTA CTG 

asp glu glu gly ile cys ser gly lys tyr phe thr glu ser gly leu val gly leu leu 

2642 2672 {xxxxx IT AM xxxx{ |xxz 

GAA CAA A GCA GCT GCT TCC TTC TCT ATG GCT GGC ATG TAT GAA GCA GTT AAT GAA GTT TAC 
glu gin ala ala ala ser phe ser met ala gly met tyr glu aia val asn glu val tyr 



ITAK xxx| 2732 
AAA GTA CTT ATT CCT ATT CAT GAA GCT AAT CGG GAT GCA AAG AAA CTA TCC ACA ATT CAT 

lys val leu ile pro ile glu ala asn arg asp ala lys lys leu ser thr ile his 
2762 2792 

GGT AAA CTT CAA GAA GCA TTC AGC AAA ATT GTT CAT CAG AGT ACT GGC TGG GAG CGG ATG 
gly lys leu gin glu ala phe ser lys ile val his gin ser thr gly trp glu arg met 
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2822 jxxxxx IT AH xxxx| 2852 

TTT GGC ACC TAT TTT CGT GTT GGT TTT TAT GGA ACC AAG TTC GGG GAT TTG GAT GAA CAA 

pfae gly thr tyr phe arg val gly phe tyr gly thr lys phe gly asp leu asp glu gin 
2882 2912 

GAA TTT GTT TAG AAG GAG CCT GCA ATA ACC AAA CTT GCA GAG ATA TCT CAC AGA TTG GAG 
glu phe val tyr lys glu pro ala ile thr lys leu ala glu ile ser his arg leu glu 

2942 2972 

GGA TTT TAC GGA GAA AGA* TTT GGA GAG GAT GTG GTT GAA GTA ATC AAA GAC TCT AAT CCT 

gly phe tyr gly glu arg phe gly glu asp val val glu val ile lys asp ser asn pro 
3002 3032\xxxxx ITAM xxxx\ 

GTA GAC AAG TGT AAA TTA GAT CCT AAC AAG GCA TAT ATT CAG ATT ACC TAT GTG GAG CCA 

val asp lys cys lys leu asp pro asn lys ala tyr ile gin ile thr tyr val glu pro 
3062 3092 

TAC TTT GAC ACA TAT GAG ATG AAG GAC AGA ATC ACC TAT TTC GAC AAA AAT TAC AAT CTT 

tyr phe asp thr tyr glu met lys asp arg ile thr tyr phe asp lys asn tyr asn leu 
• ll22 3152 

OGT CGA TTC ATG TAC TGT ACA CCC TTT ACT TTA GAT GGC CGT GCC CAT GGG GAA CTT CAT 

igrg arg phe met tyr cys thr pro phe thr leu asp gly arg ala his gly glu leu his 

f|3 182 * 3212 

JGAA CAA TTC AAA AGG AAG ACC ATT CTG ACT ACG TCT CAT GCC TTT CCT TAT ATT AAA ACA 
rrglu gin phe lys arg lys tlir ile leu thr thr ser his ala phe pro tyr ile lys thr 

; 3242 3272 \ n-mmrr^mrmnmr^Y 

jlAGG GTC AAT GTC ACT CAT AAA GAA GAG ATC ATC TTA ACA CCA ATT GAA GTT GCT ATT GAG 

U^arg val asn val thr his lys glu glu ile ile leu thr pro ile glu val ala ile glu 

gxXX»TXTTTTXXXXXXXXXTXT*X3rTXr Coiled COil 1 ytttttttyttt yttttttytty ytttt r t * 
f^GAC ATG CAG AAA AAG ACA CAG GAG TTG GCA TTT GCA ACA CAT CAG GAT CCC GCA GAC CCC 

asp net gin lys lys thr gin glu leu ala phe ala thr his gin asp pro ala asp pro 



YTTYTYYYYYTYYTYYYYTYYYYYYYYTYYTYTTT | 

AAA ATG CTT CAG ATG GTA CTC CAG GGA TCT 
lys met leu gin met val leu gin gly ser 

3422 

GAA GTT GCC CAG GTT TTT CTG TCT GAA ATA 
glu val ala gin val phe leu ser glu ile 



3392 

GTA GGC ACC ACA GTG AAT CAG GGG CCT TTG 
val gly thr thr val asn gin gly pro leu 

3452 

CCT AGT GAC CCA AAG CTC TTC AGA CAT CAT 
pro ser asp pro lys leu phe arg his his 



3482 3512 (xxxxxxxxxxx 

AAT AAA CTG CGA CTC TGC TTT AAA GAT TTT ACT AAA AGG TGT GAA GAT GCC TTA AGA AAA 

asn lys leu arg leu cys phe lys asp phe thr lys arg. cys glu asp ala leu arg lys 
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xxxxxxxxxxxxxxxxxxxxxx Colled coil 2 xxxxxxxxxx xxxxxxxxxxxxx ^xxxxxx | 

AAT AAG AGC TTA ATT GGG CCG GTT CAA AAG GAG TAT CAA AGG GAA TTG GGG AAA CTA TCT 

asn lys ser leu ile gly pro val gin lys glu tyr gin arg glu leu gly lys leu ser 
3602 3632 

TCG CCT TAA AGA GGC CCT ACA GCC CTA GAT CAC AGA AAG TCC CTC AGT TAT CCA AGC CAG 
sex pro STP 

3662 3692 

TAT TGC TTG TCC CCT GCC ACA GAG ATT CCT TCA GTC GAA TGA GCT TTC GCA AAA TGG ATC 
3722 3752 

TCT AAA CTG AAT GCA CTT GTT TTA TTC ATC TGC AAA GAG CCA TGT ATT CAA CAT CGA GTG 
3782 3812 

TGA AAA GAT CTA TTG GAA ACC AAC ATG GAA TGG AAT TCT GGA AAT TAT TAT TCA TTG AAG 
3842 3872 

AAT GCA GTG GCC AAG AAA ATA TCA AAT GTA GAT TGT TAA CGC TTG AGA ATC ATG GCT ATG 
il02 ' 3932 

0$T TCT AAT GTT CGG GTA ACA AGC TGT TAT CTT TTA AGA CAT TTT AAT GAC TCA AAG GTA 

62 3992 
j^fLC TAT ACA TTT ACC ATT ATT TAT ACC ATA GCT AAG GTT AAA AAT TTA TTC ACT TTA AGT 

lip 22 4052 

*TCG TAT TTT TTA ATT TAT ATC ACC ATT TAT AGA TTC ATT TTG GAC CCA TTT TAA ATG TAG 
JJ082 4112 

J TAA TGC TTA TTT TAA AGG TAC TAA AAA ATA TGT GAA TGT TTA CCT CGT GCG CGC CAG GGC 



:si42 

■i!tc 
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HC2A 

KIAA AS GNIJDKNARFSAI YRQDSNKLSHDDfc^^ 

rat 

HC4 

HC1 

HC3 

HC5 



HC2A 

KIAA FPNYVNSS Y IPTKQFE T CSKTP I TFEVEEFVPCI PKHT QP YT I YT13HL YVYPKYLKYDSQ 

rat 

HC4 

HC1 

HC3 

HC5 

HC2A VLHHHQNPE FYDE I K 

KIAA KS FAKARNIAI C IE FKDS DEEDS QPLKC I YGRPGGPVFTRSAFAAVLHHHQNPE FYDE I K 

rat 

HC4 

rpci 

;i%c3 

Hies 

$ £ ii 

^;fiC2A IELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPLIJaXSRVVTSEQHI 

I IjklAA IELP TQLHEKHHLJULTFFHVSCDNSSKGS T1QCRDVVE TQVG YS WLPIIiKDGRVVT SEQH I 

*Rrat 

P ; HC4 

s HC1 

LJIC3 

F: HC5 — — — — — — _ — _ — — — — — — — — — 



ttf* C2A PVSANLPSGYLGYQELGMGJUJYGPEIKST^^GKPLLKIST^ 

PVSANLPSGYJLGYC^GMGSUJYGPEIKWVI^ 

t-Jrat 

HC4 

HC1 

HC3 

HC5 



•GPGPARSTVSISLISNSARV 



HC2A QKTESGAQALGNELVKYLKSLHAMEGHVM IAFLPT I LNQLFRVLT-RATQEEVAVNVTRV 

KIAA QKTESGAQALGNELVKYLKSLHAtlEGHVMIAFI^TII^ 

rat 

Hc4 ; MEIQVLIRFLSVILMQLFWVLPNMIHEDDVPISCPMV 

HC1 - MSFLPI I LNQL FKVLV- QNEEDE I TTTVTRV 

HC3 NRSRSLSNSNPDISGTPTSPDDEVRSI IGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 

HC5 
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KIAA 
rat 

HC4 
HC1 
HC3 
HC5 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



1 1 HWAQCHEE GLES HLRS YVKYAYKAE PYVASEYKTVHEE LTKSMT T I LKP SADFLT S N 
1 1 HWAQCHEE GLE S KLRS YVKYAYKAEP YVASE YX TVBEEL T KSM T T I LKP S ADFL T SN 

LFHIVSKCHEEGLDSYLSSFIKYSFRPGKPSAPQAPLIHETLAT^ 

LPDI VAKCHEEQLDHSVQS Y I KFVFKTR ACKERPVHEDLAKNVTGLLK-SNDSPTVK 

TQAMDRSCNRMSSHTETSSFLQTLTGRLP TKKL FHEELALQ WW CSG — SVR E 

Cadherin 
Cleavage 



KLLRYSW FFFDVL IKSMAQHL I ENSKVKL1 RNQF FPASYHHAAETWNMLMPHITQKFGD 
KLLKYS WFFFDVL IKSMAQHL I ENSKVKLI RNQF FPAS YHHAVE TWNMLMPH I TQK FRD 



KLLKYSWFFFEI IAKSMATYLLEENKIKLIlHGQ^FPKAYHHALHSLFLAIT- IVESQYAE 
HVLKHS W FFFAI I LKSMAQHL I DTNKIQLP RPQF FPESYQNELDNLVMVLSDHVIWKYKD 
SALQQAW FFFELWVKSMVHHL YFNDKLEAE RKSP FPERFMDDIAALVSTIASDIVSRFQK 



HC2A 
KIAA 

rat 
HC4 
HC1 
HC3 
HC5 



NPEAS KNANHS LAVF I KRC FT FMDRG FVFKQ I N NYIS — CFAPGDPKTLFEYKFEFL 

KPEASKNANHSIAVFIKRCFTFMDRGFVFKQIN NYIS — CFAPGDPKTLFEYKFEFL 

IPKESRNVNYSIJ^FLKCCLTIWDRGFVFNLIN DYIS — GFSPKDPKVLAEYKFE FL 

ALEETRRATHSVARFLKRCFTFMDRGCVFKMVN NYIS — MFSS GDLKTLCQYKFDFL 

DTEMVERLNTS LAFFLNDLLSVMDRG FVFS L I KS C YKQVSSKL YSLPNPS VLVS LRLD FL 



%MHC2A 

'-4KIAA 

bjrat 

SJHC4 

lilHCl 

I fin*. 



I yHC2A 
H'KIAA 
hjrat 
pHC4 
SHC1 
HC3 • 
HC5 



RWCNHEHYI PLNLPM- 
RWCNHEHYI PLNLPM* 



-PFGKGRIQR- 
\PFGKGRIQR- 



YQDLQL DYSLTDEF 

YQDLQL DYSLTDEF 



QT I CNHEHY I PLNLPM AFAKPKLQR VQDSNL EYSLSDEY 

QEVCQHEHFI PLCLPIRSANI PDPLTPSES TQELHASDMPEYSVTNEF 

R 1 1 CSHEHYVTLNLPGSLLTPFASPS PSVS SATS QSS GFS TNVQDQK I ANMFELS — VPF 
MNADTAPTS PCPS I S SQSSSSCSS FQDQKIASMFDRTSRVPA 



Cadherin 
EC motif 



CRNHFLVGL 
CRNHFLVGI 


LLRE 
LLRE 


VGTALQEFRE VRL IAI S VLKNLL I KHS FDDRYAS RSHQAR I AT 

VGTALQEFRE VRLIAISVLKNLLIKHS FDDRYAS RSHQARIAT 


CKHHFLVGL 
CRKHFLIGI 
RQQHYLAGL 
SSTS-SPGI 


LLRE 
LLRE 
VLTE 
LFTE 


TS IALQDNYE IRYTAISVI KNLL I KHAFDTRYQHKNQQAK I AQ 

VGFALQEDQD VRHLALAVLKNLMAKHS FDDR YRE PRKQ AQ I AS 

LAVIII)PDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
LAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAA 



HC2A 

KIAA 

rat 

HC4 

HC1 

HC3 

HC5 



LYLPL FGLL I ENVQR INVRDVS PFPVNAG-MTVKDES LALPAVNPLVTPQKGS TLDNS LH 
LYLPLFGIJjIENVQRJLNVRDVSPFPVNAG-H TLDNSLH 

LYLPFVGLLLENIQRIiAGRDTLYSCAAMPNSASRDEFPCG FTSP — AN — RGSLS 

LYMPLYGMLLDNMPRIYLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFS 

LYLPL I GI IMETVPQLYDFTETHNQRGRPI C IATDDYESE SG SMIS 

LYLPLVGI ILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 
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HC2 A KDLLGAISGIASPYTTSTPN I NSVRNADSRGS L I S TDSGNS LPERNSEKSNSLDKHQQS S 

' KIAA T KDI^GAISGJJ^PYTTSTPNINSVRNAD^ 

rat 

HC4 TDKDTAYGS FQNG HGIKREDSRGSLI P-EGATGFPDQGNTGEN TRQS 

HC 1 KDVLNS IAAFS S IAI S TVNHADSRAS LASLDSNPSTNEKSSEKTDNCEKI PRPL 

HC3 QTVAMAIAGTSVPQ LTRPGSFLLTSTSGRQHT 

HC5 QNVALAIAGNNFN LKTSG- IVLSSLPYKQYN 



HC 2 A TLGNS WRCDKLDQSE I KSLLMC FLY I LKSMS DDALFT YWN-KAS TSELMD FFT I SE VCL 

KIAA TLGNS VVRCDKIJX3SEIKSIJLMCFLYIIJCSMS TSELMDFFT I SEVCL 

3^3, t _ — — — ————————— — — — — — _—_———_————.—— — —————— — — — — — — 

HC4 STRSSVSQYNRLDQYEIRSLMCYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 

HC1 AL I GS TLRFDRLDQAE TRS LLMC FLH IMKT I S YE TLI AYWQ-RAPS PEVSD FFS I LDVCL 

HC3 T FSAESS RS LL I CLLWVLKN-ADE TVLQKW FTDLS VLQLNRLLDLLYLCV 

HC5 MLNADTTRNLM I C FLW IMKN-ADQS L I RKW I ADLPS TQLNR I LDLL F I CV 



HC2A HQFQYMGKRYIARNQEGLG — PIVHDRKS QTLPVSRNRTGMM 

KIAA HQFQ YMGKRYIAR TGMM 

rat 

HC 4 FHraYMGKRNIARVHDAWLSKHFG I DRKS QTMPALRNRS GVM 

HC1 QNFRYLGKRNI IRKIAAAF — KFVQS T QNNGT LKGSNPS CQ7S GLLAQWMHS TSRHE GHK 

HC3 SCFEYKGKKVFERMNSLTFK — KSKDMRAK LEEAILGS I GARQEMV 

HC5 LCFEYKGKQSSDKVSTQVLQ — KSRDVKAR LEEALLRGEGARGEMM 

-fJC2A HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

JKIAA HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

^rat 

tHC4 QARLQHL SSLESS FTLNHSSTTTEADI FHQALLEGNTATEVS 

;%C1 QHRSQTLPI IRGK NALSNPKL LQMLDNTMTSNSNE IDIVHHVDTEANIATEGC 

%C3 RRSRGQLERS PSGSAFGS QENLRWRKDMTHWRQNTEKLDKSRAE I EHEAL I DGNLATEAN 

"HC5 RRRAPGNDRFP GLNENLRWKKEQTHWRQANEKLDKTKAELDQEALISGNIA.TEAH 

i.* HC2A LTALDTI^LFTIAFKNQLLADHGHNPLMKKVFDVYLCF 

U ^KIAA LTALDTLSLFTLAFKNQLLADHGHNPIjMKK^ 

5 7 rat KLSRGHS PLMKKVFDVYLCFLQKHQSEMALKNVFTALRSL I Y 

HC4 LTVLDTISFFTQCFKTHFLNNDGHNPLMKKVTO 

HC 1 LTILDLVSLFTQTHQRQLQQCDCQNSLMKRGFDTYMLFFQVNQSATAL^^ 

O HC3 LIILDTLEIWQTVS — VTES — KES I LGGVLKVLLHSMACNQSAVYLQHCFAT QRALVS 

□ HC5 LI ILDMQENI IQASS — ALDC — KDSLLGGVLRVLVNSLNCDQSTTYLTHCFATLRALLA 

HC2A KFPS T FYEGRABMCAALCYE I LKCCNSKLSS IRTEAS QLLYFLMRNNFDYTGKKS FVRTH 

KIAA KFPSTFYEGRAEWCAALCYEILKCCMSKLSSIRT^ 

rat KFPSTFYEGRAIMCASLCYEVLKCCNSKLSS IRTEASQLLYFI^^ 

HC 4 KFPSAPFKGRVNMCAAFCYEVLKCCTSKISSTRNEASALLYLIJfRNNFEYTKRK FLRTH 

HC 1 KFPSAFFQGPADLCGS FCYEVLKCCNHRSRSTQTEASALLYLFMRKNFEFNKQKS IVRSH 

HC3 KFPELL FEEETEQCADLCLRL LRHCSSS I GT IRSHPSASLYLLMRQNFEIGN — NFARVK 

HC5 KFGDLLFEEEVE QCFDLCHQVLHHCS S SMDVTRS QACATL YLLMRFS FGATS - -NFARVK 

HC2A LQV 1 1 S VS QL I ADWG I GE TR FQQS LS 1 1 NNCAN S DRL I KH T S FS S DVKDLTKR I RTVLM 

KIAA LQVI I SVS QLIADWGI GGTRFQQSLS I I14NCANSDRLIKHTSFSSDVKDLTKIIIRTVLM 

rat LQVIISLSQLIADWGIGGTRFQQSLSI INNCANSDRLIKHTSFSSDVKDLTKRIRTVIH 

HC4 LQI IIAVSQLIADVALSGGSRFQESLFI I NN FANS DRPMLARAFPAE VKDL TKR I RTVLM 

HC1 LQL IKAVSQL IAD-AG I GGSRFQHS LAI TNNKANGDKQMKNSNFPAEVKDLTKR IRTVLM 

HC3 MQVPMSI^SLVGTSQNFNEEFIJIRSLKTILTYAEEDI^LRETTFPDQVQDLVFNLHMII^ 

HC5 MQVTMSLASLVGRAPDFNEEHLRRSLRT I LAYSEEDTAMQMTPFPTQVEELLCNLNS I LY 
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■VHFNEDVLMEIJjEQCADGLWKAE H 

-VHFNEDVLMELLEQCADGLWKAE RLRAG LLTSINSSS P 
-VHYSEEVLLELLEQCVNGLWKAEHYEI n}SE I SKLI GP I 
-TPYNEN ILVE QLYMCGEFLWK^E HYEL 1I ADVNKP I 1AV 



I DEEASMMEDVGMQD 

I DEEASMMEDVGMQD 

I DEEASMMEDVGMQD 

I DEEGAMKE DAGMMD 

IKEEGAAKEDSGMHD 

VLEESAVSDDW5 PDEEGI CSGKYFTES GLVGLLEQAAAS FSMAGMYEAVNE VYKVL I PI 
VLEESWSEDTLSPDEDGVCAGQYFTESGLVGLI^QAAELFSTGGLYETVMT^-^^IPI 
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YEKRRD- 

YEKRRDFERIAHliYDTli^^ 



SMKSGGTLETTHL YDT1 HRF YSK\ TEVI TR A AGS TOLL PGGLFGQ 

YENRRE FENLTQV YRT3 ,HGA YTK1 LEVMHTKKRLLG TFFRV AFYGQ 

fekqrdfkklsdi^y^hrs }ylkm aevvnsekrlfg FjYYRMAFYGQ 

1yfrmgfyg- 
tJyfrsjgffg- 



HEANRDAKKLSTIHGKLQEAFSKIVHQSTGWERMFG- 
LEAHREFRKLTLTHSKLQRAFDS IVNKDH — KRMFG- 



IXAM IXAM 
-FFEDEDGKEjYTTKElPKXTPLSEI^ 

GFFEDEDGKE YI YKE PKLTPLSEISQRLLKL YSD* FGSENVKMIQDSGKVNPKDLDSK YA 
GFFEDEDGKE YIYKE PKLTPLSEISQRLLKL YSDF FGSENVKMIQDSGKVNPKDLDSF FA 
SFFEEEDGKE YIYKE PKLTGLSE I SLRLVKI YGEF FGTENVKI IQDSDKVNAKELDP* YA 
GFFEEEEGKE YIYKE PKLTGLSEI SQRLLKL YADB FGADNVKI I QPSNKVjN PKDLDPHYA 
TKFGDLDEQS FVYKE PAITKLAEISHRLEG5 YGEP FGEDWEVIKDSNPVDKCKLDPNKA 
SKFGDLDEQ5 FVYKE PAI TKLPE I SHRLEAF YGQC FGAEFVEVIKDSTPVDKTKLDPNKA 



IXAM 

YlbVTHVI P FFDEKELQERKTE FERS HN I RRFMFEMP FTQT GKRQGGVEE QCKRRT I LTA 
YI3VTHVIPFFDEKELQERKTEFERSHNIRREMFEMPFTQTGKRQGGVEEQCKRRTILTA 
YI 2VTH\rePFFDEKEI^ERKTEFERCHNIRRFMFE^ 

HIDVT YVKPYFDDKELTERKTEFERNHNISRFVFEAPYTLSGKKQGCIEEQCKRRT^ 
YI QTflTYV T V 1;' t'EEKEIEDRKTDFEMHHNIKRFV FET P FT LSGKKHGGVAEQCKRRTILTT 
YIQITYVEPYFDTYEMKDRITYFDKNYNLRRFMYCTPFTLDGRAHGELHEQF 
YIQITFVEPYFDEYEMKDRVTYFEKNFKLRREHYTTPFTLEGRPRGELHEQYRRNTVLTT 
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I HCFPYVKKRI PVMYQHHTDLNE I EVAI DEMS KKVAELHQLCS SAEVDM I KLQLKLQC SV 
SNS FP YVKKR I P INCEQQINLKP I DGATDE I KDKTAELQKLCS S TDVDMI QLQLKLQC W 
SHLFPYVKKRIQVISQSSTELNF IEVAIDEMSRKVSELNQLCTMEEVDMISLQLKLQC SV 
SHAFPYIKTRVNVTHKEEIILTP I EVAIEDMQKKT QELAFATHQDPADPKMLQMVLQC SV 
MHAFPYIKTRISVIQKEEFVLTP IEVAIEDMKKKTLQ3AVAINQEPPDAKML<:^^ 



Coiled-Coil 2 
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GATVNQGPLEVAQVFLAE IPADPKLYRHHNKLRLCFKE FIMRCGE qVEKHKRLITADQRE 

Coiled-Coil 2 

YQEEMKAOT^^ 

YQEEMKANYREMAKELSEIMHE QLG 

YQEEMKANYREIRKELSDI I VI RICPGEDKRATKFPAHLQRHQRDTNKHSGSRVDQFILS 
YHEGLKSNFRDMVKELSDI I HE QI LQEDTMHS PWMSNTLHVFCAI S GTS S DRGYGSPRYA 

YQEELRSHYKDMLSELS TVMNffi QI TGRDDLSK RGVDQTCTRVI SKAT PALPTVS I S S 
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1 

A 

2 32 

CGA GTA AAT CGT TCT CGA AGC CTT AGT AAT AGC AAT CCA GAT ATA TCT GGG ACT CCC ACG 
62 92 

TCA CCA GAT GAT GAA GTT CGA TCA ATC ATC GGG AGT AAG GGT TTA GAT CGC TCC AAT TCC 

* 

122 . I 52 

TGG GTT AAC ACT GGT GGT CCA AAA GCT GCC CCA TGG GGA TCC AAC CCC AGT CCA AGT GCA 
182 212 

GAA TCA ACA CAG GCT ATG GAT CGA AGT TGT AAT CGT ATG TCT TCG CAC ACA GAG ACG TCA 

met asp arg ser cys asn arg met ser ser his thx glu thr ser 

242 272 

AGT TTC TTA CAA ACA TTA ACG GGA CGC TTA CCA ACT AAA AAG CTT TTT CAC GAG GAG CTG 
ser phe leu gin- thr leu thr gly arg leu pro thr lys lys leu phe his glu glu leu 

302 332 

aCT TTG CAG TGG GTT GTT TGC AGT GGC AGC GTT CGG GAA TCA GCT TTG CAA CAA GCC TGG 
kla .leu gin trp val val cys ser gly ser val arg glu ser ala leu gin gin ala trp 

ref 1.1 and 1-2 I 
^62 ref 2.1 and 2.2 1 392 

LlfTC TTT TTT GAA TTA ATG GTA AAG AGC ATG GTG CAC CAT TTA TAC TTT AAT GAT AAA CTT 
e phe phe glu leu met val lys ser met val his his leu tyr phe asn asp lys leu 



'421 452 



;|aG- GCT CCA AGG AAA AGT CGT TTT CCA GAA CGT TTC ATG GAT GAC ATT GCA GCT CTT GTC 
'glu ala pro arg lys ser arg phe pro glu arg phe met asp asp ile ala ala leu val 



HB2 512 

f&GC ACG ATT GCT AGT GAT ATA GTT TCA CGA TTT CAG AAG GAC ACA GAA ATG GTT GAG AGA 
is&er thr ile ala ser asp ile val ser arg phe gin lys asp thr glu met val glu arg 

i 



;^42 572 
;:i|:tc AAT ACA AGC CTT GCA TTC TTT CTC AAT GAT CTG TTG TCT GTT ATG GAC AGA GGA TTT 
" s ieu asn thr ser leu ala phe phe leu asn asp leu leu ser val met asp arg gly phe 

602 632 

GTT TTT AGC CTT ATA AAG TCC TGC TAT AAA CAG GTG TCT TCA AAG CTT TAC TCA TTA CCG 
val phe ser leu ile lys ser cys tyr lys gin val ser ser lys "leu tyr ser leu pro 

J ref 3,1 and 3,2 
692 
TG TCC TTG AGG CTG GAT TTT CTA CGA ATC ATC TGC AGT CAT GAG 

asn pro ser val leu val ser leu arg leu asp phe leu arg ile ile cys ser his glu 
722 752 

CAC TAT GTT ACA TTA AAC TTA CCC TGC AGC TTA CTT ACT CCA CCT GCA TCT CCA. TCA CCT 
his tyr val thr leu asn leu pro cys ser leu leu thr pro pro ala ser pro ser pro 

J ref 4.1 and 4.2 
812 

TCT GTT TCT TCT GCA ACA TCT CAG AGT TCT GGA TTT TCT ACG AAT GTA CAA GAC CAA AAG 
ser val ser ser ala thr ser gin ser ser gly phe ser thr asn val gin asp gin lys 
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842 872 

ATT GCA AAT ATG TTT GAA TTA TCC GTG CCT TTC CGC CAA CAG CAT TAT TTG GCA GGA CTT 

ile ala asn met phe glu leu ser val pro phe arg gin gin- his tyr leu ala gly leu 
902 932 

GTG TTA ACA GAG CTG GCT GTC ATT TTA gac cct gat get gaa gga ctg TTT GGA TTG CAT 
val leu thr glu leu ala val ile leu asp pro asp ala glu gly leu phe gly leu his 

962 • 992 

AAG AAA GTC ATC AAT ATG GTA CAC AAT TTA CTC TCC AGT CAC GAC TCA GAC CCG CGG TAC 
lys lys va l ile asn met val ^is asn leu leu ser ser his asp ser asp pro arg tyr 

1022 1052 

TCT GAC CCT CAG ATA AAG GCT CGA GTG GCC ATG TTG TAT CTA CCT CTG ATT GGT ATT ATC 
ser asp pro gin ile lys ala arg val ala met leu tyr leu pro leu ile gly ile ile 

1082 1112 

ATG GAA ACT GTA CCT CAG CTG TAT GAT TTT ACA GAA ACT CAC AAT CAA CGA GGA AGA CCA 
met glu thr val pro gin leu tyr asp phe thr glu thr his asn gin arg gly arg pro 

30S42 1172 

AiT TGT ATA GCC ACT GAT GAT TAT GAA AGT GAG AGC GGA AGT ATG ATA AGC CAG ACC GTT 
ifie cys ile ala thr asp asp tyr glu ser glu ser gly ser met ile ser gin thr val 

|i202 1232 

fgfC ATG GCA ATC GCA GGG ACA TCG GTC CCT CAA CTA ACA AGG CCT GGC AGT TTC CTC CTC 
IMa met ala ile ala gly thr ser val pro gin leu thr arg pro gly ser phe leu leu 
*p |ref5-l and 5,2 

i262 A 1292 

ACG TCA AC(j AGT GGC AGG CAA CAC ACT ACC TTT TCA GCA GAA TCA AGT CGA AGC CTT TTG 
thr ser thr ser gly arg gin his thr thr phe ser ala glu ser ser arg ser leu leu 

S322 1352 

I^C TGT CTA CTT TGG GTT CTC AAA AAT GCA GAT GAA ACA GTT CTA CAG AAG TGG TTT ACA 
M.e cys leu leu trp val leu lys asn ala asp glu thr val leu gin lys trp phe thr 

&Q2 1412 

GAT CTC TCA GTC TTG CAG CTA AAC CGG CTA TTA GAT CTG CTT TAT CTC TGT GTG TCT TGC 
asp leu ser val leu gin leu asn arg leu leu asp leu leu tyr leu cys val ser cys 

1442 1472 

TTT GAG TAT AAA GGG AAA AAA GTG TTT GAA CGA ATG AAT AGC TTG ACC TTT AAG AAA TCA 
phe glu tyr lys gly lys lys val phe glu arg met asn ser leu thr phe lys lys ser 

1502 . 1532 

AAA GAC ATG AGA GCA AAG CTT GAA GAA GCT ATT CTT GGG AGC ATA GGT GCC AGG CAA GAA 

lys asp met arg ala lys leu glu glu ala ile leu gly ser ile gly ala arg gin glu 

■ref 6.1 and 6.2 
1562 J 1592 

ATG GTA CGG CGA AGC CGA GGA CAG CTC GAG AGA AGC CCA TCT GGA AGT GCC TTT GGA AGT 

met val arg arg ser arg gly gin leu glu arg ser pro ser gly ser ala phe gly ser 
1622 1652 

CAA GAA AAT TTG AGG TGG AGG AAA GAT ATG ACT CAC TGG CGT CAA AAC ACA GAG AAG CTT 
gin glu asn leu arg trp arg lys asp met thr his trp arg gin asn thr - glu lys leu 



A 
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1682 1712 

GAC AAA TCA AGA GCA GAG* ATT GAA CAC GAA GCA CTG ATT GAT GGA AAC CTG GCT ACA GAA 

asp lys ser arg ala glu ile glu his glu ala leu ile asp gly asn leu ala thr glu* 
1742 1772 

GC£ AAC CTA ATC ATT TTA GAT ACA TTA GAG ATT GTT GTT CAG ACC GTT TCT GTA ACG GAA 

ala asn leu ile ile leu asp thr leu glu ile val val gin thr val ser val thr glu 

1802. 1832 

TCC AAA GAG AGC ATT CTT GGT GGA GTG CTA AAA GTG CTA CTA CAC AGC ATG GCC TGT AAC 
ser lys glu ser ile leu gly gly val leu lys val leu leu his ser met. ala cys asn 

ref 7.1 and 7.2 

1862 1892 

CAA AGT GCA GTT TAT CTA CAA CAC TGT TTT GCT ACA CAG AGA GCC TTG GTT* TCA AAG TTT 
gin ser ala val tyr leu gin his cys phe ala thr gin arg ala leu val ser lys phe 



1922 * 1952 

CCT GAA CTC TTA TTT GAA GAA GAG ACA GAG CAG TGT GCT GAT TTA TGC CTC AGG CTT CTC 
pro glu leu leu phe glu glu glu thr glu gin cys ala asp leu cys leu arg leu leu 

1&82 2012 

dgk CAC TGT AGC AGT AGC ATC GGT ACA ATA CGG TCA CAC CCC AGT GCC TCC CTT TAC CTA 
itfg his cys ser ser ser ile gly thr ile arg ser his pro ser ala ser leu tyr leu 

£042 2072 

C$A ATG AGG CAA AAC TTT GAG ATT GGG AAT AAC TTT GCC AGG GTT AAA ATG CAG GTA CCA 
met arg gin asn phe glu ile gly asn asn phe ala arg val lys met gin val pro 



2l02 2132 

MSG TCA CTA TCC TCC TTG GTG GGC ACA TCT CAG AAT TTT AAT GAA GAA TTC TTA AGA CGT 

fiet ser leu ser ser leu val gly thr ser gin asn phe asn glu glu phe leu arg arg 

t r> . 

§1 62 . 2192 

SCT CTA AAG ACT ATA TTG ACA TAT GCT GAA GAA GAT CTG GAA TTG AGG GAA ACA ACA TTT 
ser leu lys thr ile leu thr tyr ala glu glu asp leu glu leu arg glu thr thr phe 



1222 2252 
! *±T GAT CAG GTC CAG GAT CTG GTT TTC AAT CTC CAT ATG ATT CTT TCT GAT ACT GTG AAA 
pro asp gin val gin asp leu val phe asn leu his met ile leu ser asp thr val lys 

2282 2312 

ATG AAG GAA CAC CAG GAG GAT CCT GAA ATG TTG ATT GAT CTA ATG tac aga att gcc aag 
met lys glu his gin glu asp pro glu met leu ile asp leu met tyr arg ile ala lys * 

2342 2372 

ggt tac CAG ACC TCT CCA GAT CTG CGA TTG ACC TGG TTG CAG AAC ATG GCA GGC AAG CAC 
gly tyr gin thr ser pro asp leu arg leu thr trp leu gin asn met ala gly lys his 

2402 2432 

TCA GAA CGA AGC AAT CAT GCT GAA GCT GCA CAG TGT CTA GTC CAC TCA GCA GCA CTT GTT 
ser glu arg ser asn his ala glu ala ala gin cys leu val his ser ala ala leu val 

2462 2492 

GCT GAA TAT TTG AGC ATG CTG GAG GAC CGG AAA TAT * CTT CCT GTG GGA TGT GTA ACA TTT 
ala glu tyr leu ser met leu glu asp arg lys tyr leu pro val gly cys val thr phe 
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I ref 8.1 and 8.2 

■ 2552 

CAG AAT ATT TCA TCT AAT GTT TTA GAA GAA TCT GCG GTC TCA GAT GAT GTG GTA TCT CCA 

gin asn ile ser ser asn val leu glu glu ser ala val ser asp asp val .val ser pro 

2612 

GAT 2 GAA GAA GGT ATC TGC TCT GGA AAA TAC TTT ACT GAG TCA GGA CTT GTG GGA TTA CTG 
asp glu glu gly ile cys ser gly lys tyr phe thr glu ser gly leu val gly leu leu 

2 672 

GA^CAA GCA GCT GCT TCC TTC TCT ATG GCT GGC ATG TAT GAA GCA GTT AAT GAA GTT . TAC 
glu gin ala ala ala ser phe ser met ala gly met tyr glu ala val asn glu val tyr 

2702 2732 

AAA GTA CTT ATT CCT ATT CAT GAA GCT AAT CGG GAT GCA AAG AAA CTA TCC ACA ATT CAT 
Ivs val leu ile pro ile his glu ala asn arg asp ala lys lys leu ser thr ile his 

Iref 9.1 

2762 2792 * 

GGT AAA CTT CAA GAA GCA TTC AGC AAA ATT GTT CAT CAG AGT ACT GGC TGG GAG CGG ATG 

gly lys leu gin glu ala phe ser lys ile val his gin ser thr gly trp glu arg met 

* 

*?^22 2852 

tflT GGC ACC TAT TTT CGT GTT GGT TTT TAT GGA ACC AAG TTC GGG GAT TTG GAT GAA CAA 
e gly thr tyr phe arg val gly phe. tyr gly thr lys phe gly asp leu asp glu gin 



2882 2912 

TTT GTT TAC AAG GAG CCT GCA ATA ACC AAA CTT GCA GAG ATA TCT CAC AGA TTG GAG 

; ; |lu phe val tyr lys glu pro ala ile thr lys leu ala glu ile ser his arg leu glu 
^ ▼ 2945 2972 

*GGA TTT TAC GGA GAA AGA TTT GGA GAG GAT GTG GTT GAA GTA ATC AAA GAC TCT AAT CCT 
h|ly phe tyr gly glu arg phe gly glu asp val val glu val ile lys asp ser asn pro 

i#002 3032 

! @TA GAC AAG TGT AAA TTA GAT CCT AAC AAG GCA TAT ATT CAG ATT ACC TAT GTG GAG CCA 
rjal asp lys cys lys leu asp pro asn lys ala tyr ile gin ile thr tyr val glu pro 

4o62 3092 
TAC TTT GAC ACA TAT GAG ATG AAG GAC AGA ATC ACC TAT TTC GAC AAA AAT TAC AAT CTT 
tyr phe asp thr tyr glu met lys asp arg ile thr tyr phe asp lys asn tyr asn leu 

3122 3152 

CGT CGA TTC ATG TAC TGT ACA CCC TTT ACT TTA GAT GGC CGT GCC CAT GGG GAA CTT CAT 
arg arg phe met tyr cys thr pro phe thr leu asp gly arg ala his gly glu leu his 

3182 3212 

GAA CAA TTC AAA AGG AAG ACC ATT CTG ACT ACG TCT CAT GCC TTT CCT TAT ATT AAA ACA 
glu gin phe lys arg lys thr ile leu thr thr ser his ala phe pro tyr ile lys thr 

■ ref 11-1 

3242 & 3272 

AGG GTC AAT GTC ACT CAT AAA GAA GAG ATC ATC TTA ACA CCA ATT GAA GTT GCT ATT GAG 
arg val asn val thr his lys glu glu ile ile leu thr pro ile glu val ala lie glu 
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3332 

GAC ATG CAG AAA AAG ACA CAG GAG TTG GCA TTT GCA ACA CAT CAG GAT CCC GCA GAC CCC 
asp met gin lys lys thr gin glu leu ala phe ala thr his gin asp pro ala asp pro 

3392 

AAA AT G CTT CAG ATG GTA CTC CAG GGA TCT GTA GGC ACC ACA GTG AAT CAG GGG CCT TTG 

lys. met leu gin met val leu gin gly ser val gly thr thr val asn gin gly pro leu 

■ 3422 3452 
GAA GTT GCC CAG GTT TTT CTG TCT GAA ATA CCT AGT GAC CCA AAG CTC TTC AGA CAT CAT 
glu val ala gin val phe leu ser glu ile pro ser asp pro lys leu phe arg his his 

3482 3512 

AAT AAA CTG CGA CTC TGC TTT AAA GAT TTT ACT AAA AGG TGT GAA GAT GCC TTA AGA AAA 
asn lys leu arg leu cys phe lys asp phe thr lys arg cys glu asp ala leu arg lys 

3572 



3542 

AAT AAG AGC TTA ATT GGG CCG GTT CAA AAG ' GAG TAT CAA AGG GAA TTG GGG AAA CIA TCT 
asn lys ser leu ile gly pro val gin lys glu tyr gin arg glu leu gly lys leu ser 



3602 3632 

TOG CCT TAA AGA GGC CCT ACA GCC CTA GAT CAC AGA AAG TCC CTC AGT TAT CCA AGC CAG 

s^r pro OCH - 

3692 

TGC TTG TCC CCT GCC ACA GAG ATT CCT TCA GTC GAA TGA GCT TTC GCA AAA TGG ATC 



3752 

3cT*AAA CTG AAT GCA CTT GTT TTA TTC ATC TGC AAA GAG CCA TGT ATT CAA CAT CGA GTG 

; ? " 3812 

SgA 2 AAA GAT CTA TTG GAA ACC AAC ATG GAA TGG AAT TCT GGA AAT TAT TAT TCA TTG AAG 



■'3842 3872 

b|AT GCA GTG GCC AAG AAA ATA TCA AAT GXA GAT TGT TAA CGC TTG AGA ATC ATG GCT ATG 

w 

="3902 3932 

StT TCT AAT GTT CGG GTA ACA AGC TGT TAT CTT TTA AGA CAT TTT AAT GAC TCA AAG GTA 

3992 

CAC 2 TAT ACA TTT ACC ATT ATT TAT ACC ATA GCT AAG GTT. AAA AAT TTA TTC ACT TTA AGT 
4022 4052 

•pre TAT TTT TTA ATT TAT ATC ACC ATT TAT AGA TTC ATT TTG GAC CCA TTT TAA ATG TAG 

4ref 12,1 

TAA TGC TTA TTT TAA AGG TAC TAA AAA ATA TGT GAA TGT TTA CCT CGT GCG CGC CAG GGC 



4142 
CTC 
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Ref 1.1 

Sequence of BAC8 using primer C3S3, which spans nucleotides 340-359 of the cDNA. Exon 
sequence is underlined and represents nucleotides 364-380. 
TTTTTTGAATTAATGGT GAGCAAAAACTGAGCATGTC 

AA C A ATTTT ATGCT A GCTC ATTTGTT A CCTT A G AAATCTTTTTCTGTTGCACATCTT AA C 
GCTTTTCCATGTGCCTCTAAGACAAAATTACATGTGTTACATCTCTAAATAAACACTGT 
GGACACTCAACACAGTTTAGGTGGAATTAAGAGTGAGGCTCATTTTAACT 
TCAGGGATGGTTCK^ATAAGCTAGCTATATTTTCAAAGGAA^ 

CTAGTCATTATACATGAAGTGTATAATGACAGTATTGTAGATTTTATACCAAAGATGG 
AAAGAGCTTTATAGATACCCACTGCTATTGTTATGGCTAGTAAACCCTTAGGGAAATG 
CCAGTTACAATCAATAAAAAAACAACAGTCTGGCTGGGTGCAGTGGCTCACACCTGTA 

AJCTCAGCACTITAGAAGGCCGAGGCAGGAGGATCACTTGAGATCAGGAGTTTGAGAC 

ilfcAGCGrGGGCAACATAGCAAGAGCCCATATOTACCCAAAAAAAAT^^ 

*ilAAGCT A AAA CCCTGG NN GGCCACAAAACCTGT AGTTCCCATCTA CTTTGG A AAG GCT 

^AAGGANGGGAGGGCTTGCTTTGAGCCCCAAGAANGTTCAAAGGCTNGCNGNCAGG 

I^OTGATTCNACACKTGCAACTCCCGCATTGGGTNAACAAAANCCAAGGAANC. 

flUf 1.2 

{'Sequence of BAC9 using primer C3S3, which spans nucleotides 340-359 of the cDNA. Exon 
resequence is underlined and represents nucleotides 371-380. 

hAATTAATGGT GAGCAAAAACTGAGCATGTTCTTTAATATTTTTTCTCTTAGTGAAC 
JjJrrTATGCTAGCTCATTTGTTACCTTAGAAATCl ITJTCTGTTG^CATCTTAACGCTnT 
JjfcCATGTGCCTCTAAGACAAAATTACATGTGTT^ 
CTCAACACAGTTTAGGTGGAATTAAGAGTGAGGCTCATTTTAACTCT 
GATGGTTGCATAAGCTAGCTATATTTTCAAAGGAAACTTGTGATACATTCTTTGCTAGT 
CATTATACATGAAGTGTATAATGACAGTATTGTAGATTTTATACCAAAGATGGAAAGA 
GCITTATAGATACCCACTGCTATTGTTOTGGCTAGTAAACCCTTANGGAAATGCCAGTT 
NCAATCAATAAAAAAACAACAGTACTGGCTGGGTGCAGTGGCTTACACCTGTAATCTC 
AGCACTTTATAAGGCCCNAGGCNGGAGGATCACTTNAGATCC^CHjAGTTTGAGACCAG 
CCTGGGCAACATANCAAGAGCCCATTATCTACCAAAAAAN ri'rri'rri'ri'AAAATTAAG 
CTAAACNCTGGGTGGNACAAACCTGTTNGNTTCCNATOTNCCTTTGCAAAAGCTTANG 
AAGGGGAGGGCTTNCTTTGGANCCCCAAAAAGTTNAAAG<3GNTTGCAGTCAGCC111'1 
NAATCACCCNNNGGNCCTNTCGCATTGGGATTNCCAANANGCCAANGNAACCCCGNT 
COT^^IT^TAAAAAANT^mTTAAAGNAN^^^TT^m , NGN 
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Ref 2.1 

Sequence of BAC8 using primer HGAS7, which spans nucleotides 386-405 of the cDNA. Exon 
sequence is underlined and represents nucleotides 381-385. 

TC7TT NNGAAAAAGATTANATTATTAATTCTATGATATATTAACAATACACANCTCTAA 

CACTTGGACTATTTTTAAAATATGGCATGTAATTrAATAGATGACTGAAATATTT^ 

TTCTCAAATATTTTTTAAAGTTCCCTACAATGTTTTGTATTrGCTTAA 

AAAACCACCATATTACTTTCAGAAAATTATGCTAGCTAACAATAGGACAAAAAATTCT 

GTGTATGTC A ACA AA AAAAATTC AACCTT AAA' 1 ' 1 " 1 " 1 " 1 " 1 ' I ' 1 ' l'l ' 1 CCATAAA AAAC AGG GC 

TACTTGCCCAGGTGAGANGTGCTGCCGTATGAGCTCCTCGNTAGATTGCGCNGCCGGA 

NrTGTCGGNCCCTNCGTTTAATATAACGGCGTGNGCOTGTACCGCAGGCTNTGCTAGGT 

CGTGNTTCCCAAGATATChnT^nT^TANCATANT^ 

TNATTOTNGCAATTGTNACAATCCTAGTOTGTACNTNANAGNTCNGCCNCTGTGANNT 
CGTTGTATAGTCNGNGGCNCGCTTGl^CTGAT^^ 
NCNCCCATCTTTNCNNTTNNNNNCCCCCNTTT^ATNNTTTNN^ 
NTNAANNNACCNCC 

r *""?: 

^Sequence of BAC9 using primer HC3AS7, which spans nucleotides 386-405 of the cDNA. Exon 
\sequence is not found within this sequence. Since the primer is directed against exon sequence we 
presume that sequence derived from HC3AS7 is intron sequence. Additionally, this sequnce 
fplmatches the intron sequence found in the previous sequence (BAC8 sequenced with HC3AS7). 
I GCGCTNCCNNhrmMTTATCTTCTGAAAAGACT^ 
f-CACACTCTAACACTGGACTTN*^ 

Wagcttctcaaatatntttaangtccctacaatgtttgnatot 
fy aaaccccatattactttcagaaaaotatgctagct^ 

5 tgtatgcaac aaa aaaaaattcaaccttn a atc i ' 1 'ci* it 1 " i " i ti 'ccaatanaa aacagg 

O GCTACTCTGCCACAGGCTCKjAGTCA 

CTATGTGATTGCCTGCCTAAGCCTCNGAGTAGTAGGCTCAGGTGCCACTACATGCCAG 
TAATCTAAAATTTATAGAGACAGGGCTGCTGTGTGNCCAG<XrrGGCTAACTCCGGGCT 

AAGCGTTCTTGCCTNGCTCTAAATGTGGGATACAGNATGTATCATNCATCAGCCAAAA 

AGTTAATTAArmCCAGATNAhrTATTTGC^TCAAACrCTCCAATOTAGC^ 

ACCTGCTCKrTGGCTAGAOTATCCCGNOTGTTATGGATCATATTANGCNNT^ 

CGAATGGNATCTATTCCGGGAGACANATTACTATNGGATGANAGCANATNGCCCNNAT 

GCTTmTTGTAACGCT^ANhrTAAGAAChriTCTNGAC^ 

NNGCGAmTGATACTAAhTTTC^TGNT 

GGGAGAACCTACm'NCCCCm'ACNNATANNCTNCACCCCCTACTACTOTNNCNNTCNC 

TCTCTAhm , CTACTC<^CNTTAT^^ 

GCCNCNANACTTANCNTTNATNCACTCTNNCT 
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Ref3.1 

Sequence of BAC8 using primer C3AS4, which spans nucleotides 737-756 of the cDN A. Exon 
sequence is underlined and represents nucleotides 677-733. 

TAATGTACATArTTGCTCATGACTGCAGATGATTCGTAGAAAATCCAGCCTCAAGGACA 

CCA G A ACA CTGGG ATTCGGTA ATG AGT A AAG CTTTG AA G AC A CCTTGT AAGC AATGC A 

TAAGTAAGAGAACACCAATTGAATCTATTATTTCTTTAATACTAATACCAGAATGGCA 

AATTAGAATTAAAGAGATAGTACTTGGTATCCAGTTTGGGTTTTGTGGCTTAAGTAGCA 

GTATCACCTTTtTCCAGAGTTACTGCTAAAATTAAAAATmAAACTAT 

GTATAAACATATTTGACTAACCTAAAAGCCACATTCTTGTATTTCCAATATAG<]ATCAA 

TATTTCTACTTCTCATAAAACAGGGAAAACGTATATCACCAAAAATAACTTCTTATTAC 

TTCCTTCTTAAAAGAAATTATCAATTCTTTTTATAGCACTTTGTGCTTACCT 

AATTTGTCTGTTTTCTCAGCAACATCATAAGCTACTTGAGGAGACATACTATAAACTGA 

TTTAACAGCTTTAGTGTCCCTACAGCTTAGCTCAATGTTTGACAAATATAGGAGATCAA 

TGCTT AAA G G AAT A AA G GCC AG G A C AAGTT CTG GT A GCA AATA GTGCAT AAAA GGTTT 
TGGGGGAAAAGGOTAAAAATGGATACATATCGGGGTNGCAAGhrTTTTTCCATGTGGG 

GTGAGGTGCCCCATGCCTT 
^JRef3.2 

WSequence of BAC9 using primer C3AS4, which spans nucleotides 737-756 of the cDNA. Exon 
ji sequence is underlined and represents nucleotides 677-730. 

"H TAACATAGTGrTCTGACTGCAGATGATTCGTAGAAAATCCAGCCTCAAGGACAC CAGA 
cSaCACTGGGATTCGGTAATGAGTAAAGCTTTGAAGACA CCTTGT AAGCAATGCATAAGT 

» AAG AGAACACCAATTG AATCT ATT A' 1 ' 1'1'Cl'l'l AATA CT AAT A CCAG AATGGC AAATT A 
S GAATTAAAGAGATAGTACTTGGTATCCAGTTTGGGTTTTGTGGCITAAGTAGCAGTATC 
P= ACCTTTTTCCAGAGTTACTGCTAAAATTAAAAATTTTAAACTATCAGG 
W AA CAT ATTTGACTAACCT AAAA GCCACATTCTTGTATTTCCAATATAGCATCAATATT^ 
0 CTACTTCTCATAAAACAGGGAAAACGTATOTCACCAAAAATAACTTCITATTAOT 

S ~ TCTTAAAAAGAAATTATCAATTCTT^ 

TTGNCTGNTITTTCTG\GCAAACATCATAAGCrACTTG 

GATTACAGCTTTTANGTGTCCCTACAGCTTAAC^ 

TCAATGGCTTTAAAGAATAAAAGANCAGGGACAAGTTNTGGGTNGCCATNAGNACAA 
TAAAGGTTTTNGGGGGAAAAGGGAAAAAATNGATTNCATKTCGNGGTTNGCAAGGTN 
TTTTCCATTGNCWGGNGGAGGGGCCCATGCCATAAmTTTAACC lTlCl'l 111 1NGAAG 

AAATT AAACNNTTAAAGGGGTN 
Ref4.1 

Sequence of BAC8 using primer HC3AS6, which spans nucleotides 924-944 of the cDNA. Exon 
sequence is underlined and represents nucleotides 802-917. 
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CCAGTCTGCAATATGCTGTGCGAAGCCOATATCAACrTTGCATCTTTGTCTTGNCATTC 
GAGAAATCAGACTTGTGGAAGTAGGAGACAGCTTA CAGCGTGCACAAGCTCTCAGCA 
GAGCATATACGAATGAATCTTTTCCAGGGAGTTATTTATATACTACCTGAGCAAGCCA 
CTTTAGCTTTGGGCAGGAACTTOTGGATGTTATAAGTAATACT^ 

AATTAATATTTACTTCTTTTACANTCTTCTC 111 - 1 CCTTATCTTAGCCTTTATCCCCTTGT 

GGAAAAGACACTATCAATGCTAGATNCTCCCAAGNCAGAGAATTATGCAGGTTTGGTC 

AGAGAATCGACACAGACATGTTTACAGATTCTTCTTGAAATACATATTGTGCACGAGT 

TTmACAOTATCTCAATTTAGATCTCAGACAGCATNTNGACTAGNGGGTCTAGGACAT 

AGATACATNTTTGNGAACTTCTATAGAANAACNT^^rGC^^TAAAAAGGAGCT^GTTNG 

ANANGAATN^CTGNGAAGGGCCCGATACGANAATrTGACTTCGGNGAAAATTNNNG 

GATTN^^^ACAAA^^ITCTAGGNGGCACCTTO 

AAAAAAGCCCTTCimTAGOTOTCCCNGAAATGGAAAAGTNCCAAOTTCCNAAAA 
ANGGGCTTTGTTNNCTTNCNANA 



C3Sequence of BAC9 using primer HC3AS6, which spans nucleotides 924-944 of the cDNA. Exon 



^J GGATCTTTGTCTNGNCATCGAGAAATCAGACTCTGTGGAAGNAGGCAGACAAGACTAT 
]J ACAGCNTGCACANAGCATCTCAGCAGGCATATAAGAATG AANCTTTTCCAGGGAGTTA 
ijTTTATATACTACCTGAGCAAGNACTTCAACnTNGGCAGGAACTTGTGGATGbnTATAA 
t " ' GTATACTTATATGAATAANATNGAA ATTAATATTT AATTC 1 " 11" 1 ACT I <_TI CTCTTTTCC 
^TTATCTTAGCCTTTATCCCCTCGTGAAAAAGAGCACT 

W NCAGGAATTTATAGCAGGTTGGTCGAGAATCGACACGACATGTTTACAGANTCATCTT 
? " GAATACATNATTGTGCACGAGThTITTTACTCTATCTCAAAATATAGATCTCAGATGGTC 
3 TATNGA^ATGNGGTTCTAGGACATGATTACATTTTTNGGGAACTTCCATAGAATAAA 
3 COTOTACCTNAAAANANGAGCCTGTTNGAAATNGAATCTACTNCTAAAGGGCNAGTNC 
CANATTTTACTTCCGCGANATOTCNGGATGTTACAAGTCTAGGGGGNCTTTAGNACGT 
TNGAT>mTGANCGGAAAAAAGCCCTTCTANNGGTCNCCTAATGGAAGCGCCAATTCC 
NAANAAGGNCTGTGTTKTTNGACATTTACCNGNNCC 

T^NANCCNCANCNCNNNCCTATANNCCTATCNCTCNNCTNNNCTCNTCACTCTCNNC 
NCTOTCTTCCNrTTCTNCACTr^ 

^WOTCANNCTCCCTACN^^^NCNC^^ , ^m'ACCATCTNCNCCNNCCT 



Sequence of BAC8 using primer C3S6, which spans nucleotides 1 127-1 146 of the cDNA. Exon 

* 

sequence is underlined and represents nucleotides 1181-1269. 



GTATGATCCGCCAGACCGCTGCCATGGCAATNGTAGGGACATCGGTCCCTCAACTAAG 
AAGGCCTGGCAGTTTCCTNCTCACGTCAACGGTCAAAACAATCCTTCrACAGAATTTTT 



Ref4.2 




Ref5.1 
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TTTTCTNGAAAGACAAATATTTACTAGGATATGCCCTTAAATATATGAGATGATTGTAT 

CAGCTGATGCAAAAGTGCTCAGTTTTATTTATGAAAATATTAAAGTTCCCAGAATATrA 

ACTGTCTTCTCCCAAACAGTTTTAAAAAATGATTACCTCAAGGTTTATGGGAAAAAGC 

CCCGTATTCTGCATTCAGAATTTGGAAAATTGCCTCATTATAGATAGCCATNTCTTTTTT 

TTNTTTTTTTTATNCTTC AAGTCTTA GG G N AC ATGTGC AG AACATGC AGGNT AGTT AC A 

TATGTATACATGTGCCATGTTGGTGTGCTGCACCCANNAACCCGCAATTTAACATTAGG 

T^^^ATCTCCAAATGCTATCC^^ITCACCCTTCCCCCATNCCACAACAAGGCCCCGGGCNT 

TGNGATGTTCCCCTTCCTGTGCCCACTGTGTOTCACATTNCCNCTTCCCNCCCTTANTN 

NNGTGCAGAACOTNGCCNGTNCKZCCTm'mTnTNNCCC 



RefS.2 

Sequence of BAC9 using primer C3S6, which spans nucleotides 1 127-1 146 of the cDNA. Exoo 
sequence is underlined and represents nucleotides 1151-1269. 

rcnctgatgattatgaaagtgagaggggaagtatgataagccagaccgttgccatggc 

faatcgcagggacatcggtccctcaactaacaaggcctggcagtttcctcctcacgtca 

■' acgg taaaaacaatcctcctacagaarr 1" l'l' 1' 1 t 1 ctagaaag acaaatatttactagg 

i | jtatgcccttaaatatatgagatgattgtatcagcttgatgcaaaaagtgctcaggttt 

^tttatgaaaatattaaagttccagaatatttaactgtc^ 

fi^tgatacctcaggtttatckjggaaaaaagccccgtatrctgtca 

;|\aaatttx3nctcattatagatagttcatm 

* agtttttaa ggggnaccatgttgcaccaaanattgcaggggttnggttaccattatgg 

hqtattnccattggtncccccaotgtttggggngttggctttgccacccccc^ 

j-"accnnccgotgcgaatttttaaaacaabrttttgggg<} 

fynggcmttttnccctttncccccctttcncccncccnkttcccaacnnan 

f^ecccg<}gtaotggck5gaatag^^ttcccccccttnccc™ 

c3nnnctccatttggn>rrgcaaahntcccccacccntnattgttggtggngaaa 

ccggggggtttggggg' 11 "1 ' 1 " l'l ' i " 1 ggtccccnttgccc aantaattntgcnttg aana 

aaaagaatggggttttccaaagcttttngtcnccatttg 

gttcnccitncc ana anggoccaatgtg aaacnncctttca' 1 attggggnt 

tnccnttatggn 

Ref6.1 

Sequence of BAG8 using primer C3S7, which spans nucleotides 1513-1532 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1535-1588. 

TTCTTGGGAGCATAGGTGCCAGGCAAGAAATGGTACGGCGAAGCCGAGGACAGCTCG 
GTACGTACACAATAGCTTCTCCTCCTGGTGAGAATTTCTT^ 

ATTGTAATGATCATTGTTGCTAGTCnTCAATGTCAATCCTATGCTTTTTAAAAAGTGT^ 
TAAGTGTAACTGTGAATTAACTTGAATAATCATTTCTCTGCAGTAATAAAAGTTAGAAT 
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TCTGATTTAGGTGAGTCA'GCATACCGCCCCCCCCCCCGTTTTCTCTAGAAAGTCTTCTC 
TAGAAAACGTTCTCTAGAAAGTCCTATCTAGAAAACTTTCTCTAGAAAGTCCTTATGTG 
ATT AATAG C ATCC ATCCTCCCTTTTTAAATA GA CTTTATTTTTGT AG AGCAGTTTTA AGT 
TCACAGCAAAAGTGAGCAAAGGTACAGAGATTTCCCATATACCCCTTAGTATGCGTAG 
CCTCCCCCATTATTAACATCCCCCATCAAGAGTAGTGCATTTGTTGTAACTGGTGAACC 
TACATTAACACATCATCACCCAGAGTCCG<^GTTTACATTAGGGATCATTCATATAACA 
TCTATTTTTACTTTTTTITTTTT A GTTG A G ACAAG ATTCTCGCTCTGTCACCCAAG CTGG 
AGTGCAGTCCGNGTGGATTGTNGGCTTACTGNCN 

Ref 6.2 

Sequence of BAC9 using primer C3S7, which spans nucleotides 1513-1532 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1547-1588. 

gtgccggcaagaaatggtacggcgaagccgaggacagctcggtacgtacacaatagc 

ttctcctcctggtgagaatttcttcaatttccttg 

tgctagtcttcaatgtcaatcctatgctttttaaaaagtgttttaagtgtaact 
fttaacttgaataatcatttctctgcagtaataaaa 
uc^gcataccgcccccccccccgttttctcragaaagtc^ 
^baaagtcctctctagaaaacittctct 

vctccctttttaaataagactttatttttgtag . 
fikgcaaagggcagagat^cccatataccccttagtatgcgtagot 
:|catccccatc^gagtaagngcatttgttgtaactggngaacctacaattnac^ 
Jtnaccccacagtcccgcaggtttacatttatgggatcattcccctaotaacacct 

UtTTTAC iTi'l'lTl Ml 1 1 1 i AGGTTG AG ACAAGAATTTTCGGC 1 I'd ' 1 GTGTACCCCCAA 
fUACTNGGTAGTAGNCNNACCGTCGNGNAATTTWGGGGTTTCOTNGNGNNCAKTT^ 

^cNNTCNNCTTNNCNCCNAAAGAANrirrrrriCACCcrrrrrri icccccnaananann 
iIanccttccccttggnggangctggggacttccncagnggngg 

Ref 7.1 

Sequence of BAC8 using primer C3S8, which spans nucleotides 1842-1860 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1861-1917. 

CAAAGTGCAGTTTATCTCAACACTGTTTT GCTCACA GAGAGCCTTGGTTTCAAAGG TAG 
GTTATTTTGTACCTGCAGTGTrGTCAG ACTTTG Tl'lTlTl TATTAACATTGTCT AAG ATC 
ATrTGACACATTCATTGGTTAATATATGTAGTAATATATTAATGAATATGTGTAGTTAA 
AATTTAAATAATAACCTAAGACCCTTAATTCl' 1 C 1" 1 " 1 GCCTCTCTA CTGCTGCCTGCCTT 
TTAGAATTTTTCATTTATTCGAATCACCTITAACCAGTTCTGGTTTGA 

TTGGATGTGGAGAAGGGCCTGAAATTAATAGCCAATCTTAAATATGGGGCTTCTCTTG 
TTTrCTCTTCACTTGGTTCTGTTTTTATAAAAAACTCAAT^ 

AA GC AATTCAA C CC A CTG AAATT ATTTTATG ATG AATG G AAAAG AAGGT ATGTGTTTG 
TTCAACTGCTTTAAATGTITACITCTTATATTTGTTTTC 
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AAATTTTGAAGGTAGCrATTTCATTTTAATCATCCTAGAGGATGGAATGCANAGATGTT 
GG ATG AA AAT AA CTTACGTATTATTTTGT AAT AA ATAT AA G AATTCATATATGGTTG AT 
TACCTAAGTGGTTTTATGCACATTCTGATAGAAAGCTTCACCAACAATCCCTTGNTNGA 
TATATTATTT 

Ref7.2 

Sequence of BAC9 using primer C3S8, which spans nucleotides 1842-1860 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1864-1917. 

AAGTCAGTTTATCTACAACACTGTTTTGCTACACAGAGAGCCTTGGTTTCAAAGG TAGG 
TTATTTTGTACCTGCAGTGTTGTCAGACTTTGTTTTTTTTA 

TTTGACACATTCATTGGTTAATATATGTAGTAATATATTAATGAATATGTGTAGTTAAA 

atttaaataataacctaagacccttaattcttctttck:ctctctact 

tagaatttttcatttattcgaatcacctttaaccagttctggtttgaaa 

tggatgtggagaagggcctgaaattaatagccaatcttaaatatggggcitctcttgn 

tttctcttcacttggttctgtttttataaaaaactcaatttata 

cagccattcacccactgamttattttatgatgaatggaaaagaaaggtatgtgtttgtt 

^CCTGCTTTAAAATGNGACmrNT^ 

^iATCCTTAAAAmTTNGAAAGGGANGCTAhTITTCATTTTT^ 

^JjATTGGGAAATGCNCAAGATTTITTGATTGAAAAAN 

lyjGNAATAAAATAATTAGNAATTCCOTATTATGhTTTNGAATTACCTAAAG 

CTGCCCCATTTCrOTGATATGNAAAGCCTTTCACCAACCAAATTCCCCbr^ 

^ ATTATTTTTTNANGGGCCTCOTNTTTGTGGGGOTGGAAG 

!<iGTCCCCNC 

E ^ 

fnRef8.1 

D Sequence of BAC8 using primer C3S10, which spans nucleotides 2412-243 1 of the cDNA. Exon 

D sequence is underlined and represents nucleotides 2432-2523. 

AGTGCTAGTCCTCAGCAGCACTTGTTGCTGAATATTTGAGCATGCTGGAGGACCGGAA 

ATATCtTCCTGTGGGATGTGTAACATTTCAGG TAGGAATCTTCCAGATGTACATTAAAT 

CAAGGTATATCTTTTTTTGGTTTTAGCTTTTCTCACTGGTGTTTAGAT^ 

AAGGAAAGCTTAAAGACTTAAGCCAATGCTTCACAAGGTGAATTAACATTTCACAGTG 

ATTGTCATTAATACATTTTTAAGGAGTACTTCTTGTTGATTCTCm 

CCTCTGAATTATCAGCACTATGCTTATTTATTCTCTTTGGCTTTACT 

GTTAGATACTTTAACATCTATGGAAATGTATTACTGATAATCAGAATTCAGTAGAAATT 

CTTAATTGGCTTTTACTTCACATAGGAGATATACCAACATTCTCTATTCCCrACATAAA 

ATATTAAGATTATTTTATGACTAATACCCATGACTCACAGATGAGTTTGCCCTCTAGTA 

GGGTCATAATTCTGACCCACTAGTTGAATTCTCTGCTTACCAAGAGNCAGGTATGCTTG 

C I" V V V l(J I 1 1'CAAAACCTGTTAAATAGTAGGhnTGGGGATATTOTAAAAATTAGGTAAAT 

GGTATATCTTCTGGTGGAAANCAGAAN 
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Ref 8.2 

Sequence of BAC9 using primer C3S 10, which spans nucleotides 2412-243 1 of the cDNA. Exon 
sequence is underlined and represents nucleotides 2444-2523. 
CAGCAGCACTrGTTGCTGAATATTTG^ 

GGG ATGTGT A A C ATTTCAGG T AGG AATCTTCC AG ATGTA CATTA AATCAA GGT AT ATC 
' n ' 1 ' l ' l ' 1 ' 1 GGTTTT AGCTTTTCTCACTGGTGTTTAG A AGTTT AT AAGG AAAGCTT 

AAAGACTTAAGCCAATGCTTCACAAGGTGAATTAACATTTCACAGTGATTGTCATTAA 
TACATTTTTAAGGAGTACTTCTTGTTGATTCTCTTTCCACAGTTT 
TCAGCACTATGCTTATTTATTCTCTTTGTCTTTACTGCCTTGTAA 

AACATCTATGGAAATGTATTACTGATAATCAGAATTCAGTAGAAATTCTTAATTGGCTT 
mTACITCACATAGCAGATNTACCAACATT^ 

ATTITATGACTAATACCATGACTCACAGATTGAGTTTGCCCTCTANTAGGGTNCATAAT 
TTCTGACCCACrAGTTGAATTCTCTGCTTACCAAAAGTCANTTATGGCTT^ 
TCAAAACCCTG^m'AAT^AGGNACGGCTT^GGAGATAATITATAAAAAAT^ 
NAAA>rrGGNTTATTATTC>nTCCNNGGTTGAAAAAACCCAGGAATTGGC^ 

^^AAAAAGNTTATTCCNGG' l ' ri ' Cl ' l 1 NCGGNAAAAAAACCAAAAAATCTTNGAAATTGT 

HjTTTTTACCAAAAANGACCTCCNCNGGGAAAAAGCKj^ 

l|Ref9.1 

5 Sequence of BAC9 using primer C3S 1 1 , which spans nucleotides 2679-2698 of the cDNA. Exon 
s sequence is underlined and represents nucleotides 2711-2799. 
jz TTCCTATTCATGAAGCTAATCGGGA^ 

Li TCAAGAAGCATTCAGCAAAATTGTTCATCAGG TAATGATTCCAATTTCTAGCTTCACTA 
W TAAAGGGAAAAAACTGTCTGAAAGCATTAATGTTGTTTTGCACTGATGTCAAACTAGA 
9 TCCCGTGAAATGACC1ATTTTAATC^GACTAQ\AATGAGCGGTC^AAATGATAGTTCAT 
° GGCCAAAGCAAAGCTCATTAACAATAAAAATGAATTCACCTAAAGTAAATGGTGATCA 

TCAT AAA CTTTCTGCATA GC I'l'l'l'l'l'l'l 1 1 1 CATTTTTGAATTATTAATTAAGCAAGTTTT 
TAAAAATTGTGATTTTCTGTTTCACAAGGNAAGATCATAAGTTGNGGAAT CTCA^ t 1 1 1 1 
AA AAATTG ATACCCTATTNCTTTTGCTGNGG AAAANTGG AA G mill AATATTTTC AA 
G GT l ' l ' I ' 1 l ' l ' 1 AAAATTNAAATGG ATTGTGGAAAACCTTTT AAATNAATTT AAAACCT AC 
CT AAA AT A N1 T llTl'l AATGGNCCNNGCCANCTGGAA CCNTTTTA 1111111 CCCCTAG 
GAATGGTTTTACCCAAATCCATTCCCTTTTGAATAATATTTTTCCCT 
AAA NTITITNITI 1 1 1 1 GGGNGGAAAAAATANTTGG AAAATTAAAAAAATGGGGGTGG 
GGCChn'AAATGGGGATTATTTTTAAATTTCCTAAAAAAGGGAh^ 
NAATCCTTTTTGGGNGGhnTCNATTTATTGGGGAATCCTNCNCrr ITT 1"NTNCNCCTTA 

AAAAANTTAGGGCCTNCCAAAATTTTAA^ 
CCC lTl'CT'l NGCCCGGTTGGTTT 
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Ref 10.1 

Sequence of BAC8 using primer C3S12, which spans nucleotides 2908-2927 of the cDNA. Exon 
sequence is underlined and represents nucleotides 2928-2941. 

TACAGATTGGAGGT GAATGCTGTGGTGGTTCATAAAATGTCATCTTTAGTTTGTATTC^ 

CTCTGATGATTAGACTTTCAGATCC^GATCTAATCATTTAGTAAGCCAGATCTTGCCAA 

ATAAACTACTCCGTTAGAGAATAAGGACTTTTAATAGTTACAATAATACTCTTTCAAAT 

CTTTTATGGC^GCAATAAAATAGTAATATTGTCTATTTTTTGAGACT 

ATTTTAGAAACCCCTGTATCCTTCAGAATTACTGCGACTTAACGGAGAAATATATAGT^ 

TAATCCCACATTTTGrTGAAAAAGACAAAGAATTA^GTAGTAGCTAATAATTGAACTA 

GAACCAGAACCCTAAGAAATTTCTGACCCAAGCATATTATCTCrTTGGCTTA^ 

CCAGGTGAGGTTTCTITAGAACGTAAAAGCCT^ 

TTAACCTTTATAATTTCTTAATTTTCACCATAAATC 

TAACTAGNATTTTCTGlSrrATAGGTATTCTTTCCAACCTTTCT 

AGTGTAGTGGATGGACCGGAAGCATTGGGGTTCACCTGGGAGAATGGTTGGNAATGC 
CAGAACCCTTAGACCCCACCCCAGCCCCTGTGAAA 

j kef 10 3. 

-Sequence of BAC9 using primer C3S 12, which spans nucleotides 2908-2927 of the cDNA. Exon 
-^sequence is underlined and represents nucleotides 2930-294 1. 
. l£ cAGATAGTAGGT GAATG<TGTGGTGGTTCATA^ 

I' CTG ATGATT AG ACTTTCAGATCCAG ATCTAATCATTTAGTAAG 
H AAACTACTCCGTTAGAGAATAAGGACTTTC 

{J TTTATGGCAGCAATAAAATAGTAATATTGTCrATTTTTTGAGACTATTTTCAC^ 

2 TTTAGAAACCCCTGTATCCTTCAGAATTACTGCGACTTAACGGAGAAATATATAGTATA 
O ATCCCACATTTTGTTGAAAAAGACAAAGAATTAAGTAGTAGCTAATAATTTGAACTAG 
O AACCAGAACCCTAAGAAATITCTGACCCAAGCATATTATCTC^ 

CAGGNGACK3TATCITTAGAACGTNAAAGCCTGAAA^ 

TAACCCTTTATAANTIT^CnTAATTTTCAC^ 

CTNGGGNCTANACCTNAGGCAATrTTTCTGGGTCATAAGGGAAA' ITl'd'TlT 1'CCC AAA 
CCLT IT 1C 11 C 1 lATTTTGTTGGGNCACCTCCCAAAAGGTG^CNGTTGGGGNTTNGGG 
NCCCCGNGAAAG<jCCATlTGGGGGhrrrCACCCCCGKjGGGANGATTTGG 
TNGTCNNNAAAACCTTCCNNACCCCCTWCCCCCNGGCCCCCOTGNNNGGAAATCCAAA 
AGGATCTTGNCNATTTITITANCCAAANGANCNCC^ 

TTCCCCAANAAGANGTAAGGTTKITGGCCT 
Ref 11.1 

Bac 9 sequenced with HC3AS3, which spans nucleotides 3376-3393 of the cDNA. Exon sequence 
is underlined and represents nucleotides 3267-3289. 
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CTGANGTGTNGCAANGCCACTCCTGTGTCTTTTCTGCATGTCCTCAATAGCAACTTCAA 

TCGGTGTrAAriATGATCT GAGTNANNGAGCATCTGTrANATCAGNGTACTGACTGAAA 

CTATTTAATGAACTTTATGTATAATCAACTGAAATTAGANAAAAAAAAGATCAATNGT 

AAACTTCATGTAACAATAAAATTCCAAACTTGGATTCTAAATGAANNAAAAANATCAA 

CCTTTAAAGAAAAGCTGGGGGTGAATAAGGGCTTAGAAAAGANGTANAAAATGANGA 

CTCAAAATGGTAAAGGGTCTAATATGNATGGATAAGGATGGACATATCTTCGGACTCT 

GAGTGGTGTACATGGCTTGATGATTGCTCACTATGTGTGNCATTATG<KTACCTCTCTT 

TAGGCATGCCTGTTAAKTAGGAAGCTGAACTANCAAAGNCTCTTNGATGTATNANTCC 

TGCCGCTNAAGAAGGGGNCGCOTGANNCAAATGATTTGCNATGTNTCTGCTATNATNG 

NAAGNGNTCCTNGAhnWHTCNGANA^ 

AGNGCITCTTG<L\CCTCCTGNGCATCTCCCGT^ 

ATAA^ANNNAG<XCACm7WCCT^^ 

NAGACAATTTTTTNNCGCCCCCTCCTNOT 

TCmTNCCCCCNCawrrCTTANCTTNCT 

1 fief 12.1 

Sequence of BAC9 using primer C3S15, which spans nucleotides 3750-3769 of the cDNA. Exon 
usequence is underlined and represents nucleotides 3779-41 18, and also represent the 3 terminus of 
.Hhe transcriptional unit. 

' frGTGAAAAGATCTATTGGAAAACAACATGGAATGGAATT 

ff kjAAGAATGCAGTGGCCAAGAAAATATCAAATGTAGATTGTTAACGCTTGAGAATCATG 
? GCTATGGTTTCT AATGTTCTGGTAACAAGCTGTTATC 1' V 1 ' 1 'AAG ACATTTTAATG ACTC 
^ AAAGGTACACTATACATTTACCATTATT^^ 
" CTTTAAGTTCGTATTrTTTAATTTATATTACCATTT^ 

M AAATGTAGTAATGCTTATTTTAAAGGTACTATTAAATATGTGAATGTTTACA CTAATTT 

2 TACCGAGTGGGACTTCAAAATTTTTATTATTGACAATGGCTGAGAACAATTNAAGGGT 
1 TTGACTCNAGAACTAhnTCCAAACCTAGCAGAATAAAAATCATAGATAGCCCCAAATT 

AATGAGTTTGGGNAACTGTNTCAAAG l'l'l'l'l" 1 1 1 CCATTTAC ATA CCCAAAAACA GG AA 
ATTTTAGAATTTGCCNGAACCTTTACCTTAAGANAAAAACCCTTTTGTG 
TAmWTAAAATTCCCGG<K}GGANrrAATCTTAATNACCCCGGGTGGGGCCANNCNCC 
COTTATAACTTTGGAATTTAAAAATTCN IT I'l , 14NTNCAACCCCAAACTGNANTNGGGT 
NhTTTITNAAGGAAAACCTTTCCACTNGGAAGT^ 

AAANNGGGGAANATTGGGAAGTCTTCCCCTTC>riTNGGGGGGNGNCCCAAAAAATTCT 
TAATAAAANCCCCGGGGCTCCCATTNTTA GN ATI " 1 ' 1 T IT ITrTGGCCCCACACTGTGTT 
NATTAAANCCCC^COTGGTAAAAATTTTTNNNGAAAA 
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HC2A 

KIAA AS GNLDKNARFSAI YRQDSNKLSNDDMLKLI^^ LGNLDI TIDWSSD 
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FPNYVNSS YIPTKQFETCSKTPI TFEVEEFVPCI PKHTQPYTI YTNHLYVYPKYLKYDSQ 



HC2A 
KIAA 
rat 
!fiC4 

mci 
hc3 

=HC5 



VLHHHQNPEFYDEIK 

KSFAKARNIA2 CIEFKDSDEEDS QPLKCI Y GRPGGPVFTRSAFAAVLHHHQNPEFYDEIK 



:fHC2A 
^KXAA 
Irat 
HC4 

:^HC1 

i HC3 
f HC5 



lELPTQUJEKHHLLLT FFHVS CDNS SKGS TKKRDWETQVGYSWLPLLKDGRWT SEQK I 



HC2A 
KIAA 

rat 
HC4 
HC1 
HC3 
HC5 



PVSANLPS GYLGYQELGMGRHYGPE I KtfVDGGKPLLKIS THLVS TVYTQDQHLHNFFQYC 
FVSAN1J>SGYI^YQ£IX^GRHYGPEIKWVI^^ 



GPGPARS TVS ISL I SNSARV 



HC2A QKTESGAQAlrGNELVKYIJCSIJiAMEGHVMI AFLPTI 

KIAA QKTESGAQALGNEIATKYLKSIJIAM^ 

rat 

HC4 MEIQVLI RFLSVIMQLFWVLPNMI HEDDVPI SCPHV 

HC1 MS FLPIILNQLFKVLV- QNEEDEI TTTVTRV 

HC3 NRSRSLSNSNPDISGTPTSPDDEVRS 1 1 GSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 

HC5 ■ 
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IIHWA^HEEGLESHIJ^YVKYAVKAEPYVASEyK^/HEELTKSMr 
1 1 HWA DC HEE GLE SHJLRS YVKY A YKAE ? WAS E YK7VHEEL TKSM 7 



I LKPSAPFLTSN 
ILKPSADFLTSN 



LFH I VSKCHEEGLDS YLSS FI KYS FRPGKPSAPQAP L I HETLATMMI ALLKQSADFLA I N 

LPDIVAKCHEEQLDHSVQSYIKFVFKTR AC KE R P VHE DLAKNVT GLLK-SNDSP T VK 

TQAHDRSCNRMSSHTETSSFLQTLTGRL? TKKLFHEELALQW /VCSG—SVR E 



Cadheri: 



KLLRY SW F FFDVL I KSMAQHL I ENSKVKLL 
KLLKYS WFFFDVL I KSMAQHL I ENS KVKLI 


RNQF 
RNQF 


FPASYHHAAETWNMLMPHITQKFGD 
FP AS YHHAVE TWNMLMPH I TQK FRD 


KLLKYSWFFFEI IAKSMATYLLEENKIKL1 
HVLlii S W FF FAI I LKSMAQHLI DTNK I QL F 
SALOQAW FFFEU^/KSMVHHL Y FNDKLEAP 


HGQF 
RPQH 
RKSF 


FPKAYHHALHSLFLAIT- IVESQYAE 
FPESYQNELDNLVMVLSDHVIWKYKD 
FPERFMDDIAALVSTIASDIVSRFQK 



NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NY IS — C FAPGDFjKTL FE YKFE FL 

NPEASKNANHSLAVFIKRCFTFMDRGFVFKQLN NY1 S — CFAPGDPKTLFEYKFEFL 



IPKESRHVNYSLASFl^CCLTLMDRGFVFNLIN DYIS — G FS PKDPKVLAE YKFE FL 

ALEETRRATHSVARFL^CFT FMDRGCVTKMVN NY IS — MFSSGDLKTLCQYKFDFL J 

DTEMVERLNTSLAFFLNDLLSVMDRGFVFSLIKSCra 3il/3c2-- 



RWCNHEH Y IPLNLPM ? FGKGRI QR YQDLQL DYS LTTEF 

R WCNEZH Y I PLNLPM PTGKGRIQR YQDLQL DYSL7~'EF 



QT I CNHEHY IPLNLPM AFAXPKLQR VQDSNL EYSL5IZY 

QEVCQHEHFI PLCLP I RSAN I PDPLTPSES TQELHASDMPEYSV7NEF / 

RIICS^HYVTI^LPCSIJ,TPPASPSPSVSSAlfeQSSGFSTNVQDQKIANMFELS--VPF V* y*'<2- 
MNADTAPTS PC PS I S SQNSSSCSS FQDQKIASMFDRTSRVPA 

Cadherin 



EC motif 



CRNHFLVGL 
CRNHFLVGI 


LLRE 
LLRE 


VGTALQE FRE VRLIAI SVLKNLLIKKSFDDRYASFpHQARI AT 

VGTALQEFRE VRL I A I S VL KNLL I KHS FDDR YAS RS HQAR I A T 


CKHHFLVGL 
CRKHFLBGI 
RQQHYLAGL 
SSTS-SPGL 


LLRE 
LLRE 
VLTE 
LFTE 


T S I ALQDNYE 1 RYTAI S V I KNLL I KHAFDTRYQHKNQQAKIAQ 

VG FALQEDQD VRHLAIAVLKNLMAKHS FDDRYREPRKQAQ IAS 

IAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
LAAALDAEGEGI SEVQRKAVSAIHS LLSSHDLDPRCVKPEVKVKI AA 



L YLPLFGLL I ENVQR I NVRDVS PFPVNAG-MTVKDES LALPAVN PLVT PQKGSTLDNS LH 
LYLPLFGIXIENVQRINVRDVSPFPVNAG-MTVKD^ 



LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDE FPCG FTSP — AN — RG5LS 

LYMPLYQILLDNMPRIYLKDLYPFTVNTSN^SRDDLSTNGGFQSQTAIKHANSVDTS FS 

LYLPLIGI IMETVPQL YDFTETHNQRGRP I CIATDDYES E SG SMIS 

LYLPLYGI ILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 
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7DKDTAYGSFQNG— 
KDVLNSI^FSS— • 
QTVAMAIAG7SVPQ- 
QNVALAIAGNNFN- 



HG I KREDSRGSL I P-EGATGFPDQGN7GEN TRQS 

IAISTVNHADSRAS1ASLDSNPSTNEKSSEKTDNCEKIPRPL ^ 

LTRPGS FLLTSJrSGRQHT S 

LK7SG- 1 VLSSLPYKQYN 



HC2A 7LGNS\^T*CDKLDQSEiy^LL>;CFLYILK£MSDDALF7YW 

KIAA TLGNSVVRCDKLI>QS£1KSIXMCFLYI1J^ 

rat 

HC4 STRSSVSQY>^LIXiYEIP3Ll>4CyLYIVKMISEDTLLTYWN-KVSPQELINJLILL£VCL 

HC1 AL I GS TLRFDRLDQAE 7RSLLMC FLH IMKT I S YE 7L I AYWQ- RAPS PEVSDFFS I ifciVCL 

HC3 T FS AES S RS L L I CLL WVLKN- ADE TVLQKW F7DLS VLQLNRLLDLiJy LCV 

HC5 MLNAD77RNLM I C FL W IMKN- ADQS L I RKW I ADLPS T QLNR I LDLL F I C V 



HC2A HQPjQYMGKRY IARNQEGLG — PIVHDRKS QTLFVSRNRTCM4 

KIAA HQFQYMGKRYIAR : TGMM 

HC4 FHFRYMGKRNIARVHDAWLSKHFGIDRKS QTMPALRNRSGVM 

HC1 QNFRYLGKRN I IRKIAAAF — KFVQS T Q!iNG7LKGSNPS CQTSGL1AQWMHS TS RHEGHK 

HC 3 SCFEYKGKKVFERMNSLT FK- -KSKEMRAK LEEAILG5 1 GARQEMV 

HCS LCFEYKGKQSSDKVSTQVLQ — KSRDVKAR LEEALLRGEGARGEMM 



;*« HC2A KARLQQL GSLPNS L7FNHSYGHSDADVLHQSLLEANIA7ZVC 

KIAA HARLQCL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

rat 

HC4 CARLQKL £ BLESS FTLNHSSTTTEADI FHQALLEGNTA7EV5 

H HCI 2HRSQTLPIIRGK XALSNPKL LQMLDNTMT^NSNEIDIVHHVDTEANIATEGC i i 

flj HC3 RRSRGQI^RSPSGSAFGSQEmJUIRKDMTHWR^ W j 

HCS RRRAPGNBRFP GLNEN1JWKKEQ7HWROAKEKLDK7KAELIX5EALISGNIA7EAH 

s HC2A L7AIJDTLSLFTIAF^CL1ADHGHNPU4KKV^^ 

: ; ;n KIAA LTAJJZTLSLttlJin&QIJJiS 

Hl rat KLSRGHS PI>IKKVFI)VYU:FLQKHQSEMAIJ<KVF7AIJ^ L I Y 

H HC4 LTVLDT I S FF7QC FK7H FIJJNMHNPLMKKVFD I HLAFLKNGQSEVSLKHVFAS LRAF I S 

W HC 1 L7 1 LDLVS LF7Q7HQRQLQQCDCQN5 1>1XRGFD7YMLFFQVNQSATALKHVFAS LRL FVC 

£3 HC3 LI I LDT LE I WQ7VS - -V7ES — KES I LGGVUCVLLHSMACNQSAVYLQHCFATQRAL VS 

[J HCS LI ILDMQENI IQASS — ALDC — KDSLLGGVIJIVLVNSLNCDQSTTYLTHCF^IJ^ALIA 

HC2A KFPS 7 FYEGRAIMCAALCYE I LKCCNSKLSS I RTEASQLL YFLMRNNFDYTGKKS FVR7H 

KIAA KFPSTFYEGRAI^ICAALCYEIJUKCCNSKLSS IRTEASQLLYFmRNNFDYTGKKSFVRTH 

r a t KFPSTFYEGRAEMCASLC YEVLKCCNSKLSS I RTEASQLLYFLMRNNFDYTGKKSFVRTH 

HC 4 KFPSAFFKGRVNMCAAFCYEVLKCCTSKI SSTRNEASALLYUjHRNNFE YTKRKTFLR7H 

HCI KFPSAFFQGPADLCGS FCYEVIJCCCTHRSRSTQTEASALLYLniRKNFEFNKQKS I VRSH / 

HC3 i^ELLFEEETEQCPiDl^lJUXR^ — NFARVK 7, ' £L 

HCS KFGDLLFEEEVEQC FDLCHQVLHHCSS SMDVTRSQACATLYLIiiRFS FGATS — NFARVK 

HC2A LQVI ISVSQLIADWGIGE7RFQQSLS I INNGANSDRLIKHTSFSSDVKDLTKRIRTVLM 

K I AA LQVI I S VS QLIADWGI GGTRFQQSLS I INN CANSDRLIKHTS FSSDVKDLTKRIRTVLM 

rat LQVIISLSQLIADWGIGG7RFQQSI^IINNCANSDRLIKHTSFSSDVKDLTKRIR7VI>1 

HC4 LQI I IAVSQLIADVAL5GGSRFQESLFI I NN FANS DRPHLARAFPAE VKDI/TKR I R7VLM 

HCI LqL IKAVSQL IAD- AG I GGS RFQHSLAI 7NN FANGDKQKKl^N FPAEVKDL7 KR I R7VLM 

HC3 MC^MSLSSLVG7SQNF*TEEFI^RSI^7ILTYAEED^ 

HCS MQV7MSlA5LVGRAPDFNEEHlilRSIJ<7IIAYSEEDTAMQMTPFPTQVEELLCNINS I LY 
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1021/341 

TTT CTT GTA ATA AAG CTA GAA AAA GTC CTA 
phe leu val ile lys leu glu lys val leu 
1081/361 

CCA TAT ATG ATT TTC AAA GAA GCA GAT GCC 
pro tyr met ile phe lys glu ala asp ala 
1141/381 

AAG AGT CAA GCA GAT CAG TTT TGC CAA AGA 
lys ser gin ala asp gin phe cys gin arg 
1201/401 

ACT GCA ATC CAT TTA ATG AAT ATT GTT AGC 
thr ala ile his leu met asn ile val ser 
1261/421 

GAA GTA GAA ATC AGT ACT GGA GAA CGA AAA 
glu val glu ile ser thr gly glu arg lys 
1321/441 

ATT GTT GGC AGA CGA TCA CTT GAA AGG ACA 
ile val gly arg arg ser leu glu arg thr 
1381/461 

AGC TTT CGA CCA GCT ACT CTC ACA GTG ACA 
ser phe arg pro ala thr leu thr val thr 
1441/481 

AGT GAT GAA GAT CTC TAC AAA TTC CTT GCT 

Iser asp glu asp leu tyr lys phe leu ala 

rasoi/soi 

jCGA CTA AGA CCT ATT ACA GCT CAG CTC AAG 
,]arg leu arg pro ile thr ala gin leu lys 
"1561/521 

,lCAT TAT TGC CTA ACT CCG GAG CTG CTT CAA 
Ifhis tyr cys leu thr pro glu leu leu gin 
El621/541 

^CCT ACC AGA GAA ATC TTA GAG TTT CCC GCA 
pro thr arg glu ile leu glu phe pro ala 
^1681/561 

U AGA AAT CTT CTC TAC ATA TAC CCT CAG AGT 
^arg asn leu leu tyr ile tyr pro gin ser 

a 1741/581 

5 AGA AAT ATA ACA GTG AAA GTC CAG TTT ATG 

f arg asn ile thr val lys val gin phe met 
^ 1801/601 

GTA ATC TTT GGT AAA TCT AGC TGT TCA GAA 
val ile phe gly lys ser ser cys ser glu 
1861/621 

TAT CAT AAC AGG TCT CCT GAT TTT CAT GAA 
tyr his asn arg ser pro asp phe his glu 
1921/641 

ACT GAC CAT CAT CAC TTG CTT TTT ACT TTT 
thr asp his his his leu leu phe thr phe 
1981/661 

ACT CCT CTT GAA ACA CCA GTT GGA TAT ACA 
thr pro leu glu thr pro val gly tyr thr 
2041/681 

AAG ACT GGC CAG TTT TGC TTG CCA GTC TCA 
lys thr gly gin phe cys leu pro val ser 
2101/701 

CTG TCT CCT GAG GTT CCT CTA CCT GGC ATG 
leu ser pro glu val pro leu pro gly met 



1051/351 

CAG CAA GGA GAC ATT GGA GAG TGT GCA GAA 
gin gin gly asp ile gly glu cys ala glu- 
1111/371 

ACC AAG AAT AAA GAA AAA CTG GAG AAA CTG 
thr lys asn lys glu lys leu glu lys leu 
1171/391 

CTT GGG AAA TAT CGC ATG CCT TTT GCT TGG 
leu gly lys tyr arg met pro phe ala trp 
1231/411 

AGT GCT GGG AGT TTG GAA AGA GAT TCT ACA 
ser ala gly ser leu glu arg asp ser thr 
1291/431 

GGG TCT TGG TCA GAG AGG AGG AAT TCT AGT 
gly ser trp ser glu arg arg asn ser ser 
1351/451 

ACA AGT GGA GAT GAT GCT TGT AAC TTG ACG 
thr ser gly asp asp ala cys asn leu thr 
1411/471 

AAT TTT TTT AAG CAG GAA GGA GAC CGC TTA 
asn phe phe lys gin glu gly asp arg leu 
1471/491 

GAT ATG AGA AGG CCA TCT TCT GTC TTA CGG 
asp met arg arg pro ser ser val leu arg 
1531/511 

ATA GAC ATT TCT CCC GCA CCT GAA AAT CCC 
ile asp ile ser pro ala pro glu asn pro 
1591/531 

GTG AAG CTT TAC CCT GAC AGT AGA GTT AGA 
val lys leu tyr pro asp ser arg val arg 
1651/551 

AGG GAT GTT TAT GTT CCA AAC ACT ACT TAC 
arg asp val tyr val pro asn thr thr tyr 
1711/571 

CTT AAT TTT GCC AAT CGT CAA GGT TCT GCT 
leu asn phe ala asn arg gin gly ser ala 
1771/591 

TAT GGA GAG GAT CCA AGC AAT GCC ATG CCG 

tyr gly glu asp pro ser asn ala met pro 

1831/611 

TTT TCA AAG GAA GCC TAT ACA GCC GTA GTA 
phe ser lys glu ala tyr thr ala val val 
1891/631 

GAA ATC AAG GTT AAG CTT CCT GCT ACT TTA 
glu ile lys val lys leu pro ala thr leu 
1951/651 

TAT CAT GTT AGT TGT CAA CAA AAA CAA AAT 
tyr his val ser cys gin gin lys gin asn 
2011/671 

TGG ATA CCA ATG CTT CAG AAT GGA CGG TTG 
trp ile pro met leu gin asn gly arg leu 
2071/691 

TTG GAA AAA CCA CCA CAG GCT TAT TCT GTA 
leu glu lys pro pro gin ala tyr ser val 
2131/711 

AAA TGG GTA GAT AAT CAC AAA GGT GTT TTT 
lys trp val asp asn his lys gly val phe 
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2161/721 2191/731 

AAT GTT GAA GTT GTT GCT GTT TCG TCT ATC CAT ACA CAA GAT CCT TAT CTT GAC AAA TTT 
asn val glu val val ala val ser ser ile his thr gin asp pro tyr leu asp lys phe- 
2221/741 2251/751 

TTT GCT CTG GTC AAT GCT CTG GAT GAA CAC CTG TTC CCA GTC CGA ATT GGG GAC ATG CGA 
phe ala leu val asn ala leu asp glu his leu phe pro val arg ile gly asp met arg 
2281/761 2311/771 

ATC ATG GAA AAT AAC TTA GAA AAT GAA TTG AAG AGC AGT ATT TCA GCA CTG AAT TCA TCC 
ile met glu asn asn leu glu asn glu leu lys ser ser ile ser ala leu asn ser ser 
2341/781 2371/791 

CAG CTG GAA CCA GTG GTC CGA TTT CTT CAT CTT CTG CTA GAT AAA CTG ATA CTT TTA GTT 
gin leu glu pro val val arg phe leu his leu leu leu asp lys leu ile leu leu val 
2401/801 2431/811 

ATT AGA CCT CCT GTC ATT GCT GGC CAA ATA GTT AAC CTA GGT CAA GCA TCT TTT GAA GCC 
ile arg pro pro val ile ala gly gin ile val asn leu gly gin ala ser phe glu ala 
2461/821 2491/831 

ATG GCA TCA ATT ATA AAT CGA CTT CAC AAA AAC TTG GAA GGA AAT CAT GAC CAG CAT GGC 
met ala ser ile ile asn arg leu his lys asn leu glu gly asn his asp gin his gly 
2521/841 2551/851 

AGA AAC AGC CTT CTT GCA TCA TAT ATT CAT TAT GTT TTC CGC CTA CCA AAT ACT TAC CCT 
arg asn ser leu leu ala ser tyr ile his tyr val phe arg leu pro asn thr tyr pro 
2581/861 2611/871 

AAT TCA TCA TCA CCA GGT CCT GGG GGT TTG GGA GGA TCA GTG CAT TAT GCC ACA ATG GCT 
" asn ser ser ser pro gly pro gly gly leu gly gly ser val his tyr ala thr met ala 
h 2641/881 2671/891 

1 AGA TCT GCG GTG AGA CCT GCA AGC CTT AAT TTA AAT CGT TCT CGA AGC CTT AGT AAT AGC 
^ arg ser ala val arg pro ala ser leu asn leu asn arg ser arg ser leu ser asn ser 
1 2701/901 2731/911 

1 AAT CCA GAT ATA TCT GGG ACT CCC ACG TCA CCA GAT GAT GAA GTT CGA TCA ATC ATC GGG 
- asn pro asp ile ser gly thr pro thr ser pro asp asp glu val arg ser ile ile gly 
I 2761/921 2791/931 

1 AGT AAG GGT TTA GAT CGC TCC AAT TCC TGG GTT AAC ACT GGT GGT CCA AAA GCT GCC CCA 
ser lys gly leu asp arg ser asn ser trp val asn thr gly gly pro lys ala ala pro 

* 2821/941 2851/951 

J TGG GGA TCC AAC CCC AGT CCA AGT GCA GAA TCA A'CA CAG GCT ATG GAT CGA AGT TGT AAT 

* trp gly ser asn pro ser pro ser ala glu ser thr gin ala met asp arg ser cys asn 
I 2881/961 2911/971 

? CGT ATG TCT TCG CAC ACA GAG ACG TCA AGT TTC TTA CAA ACA TTA ACG GGA CGC TTA CCA 
t arg met ser ser his thr glu thr ser ser phe leu gin thr leu thr gly arg leu pro 
" 2941/981 2971/991 

ACT AAA AAG CTT TTT CAC GAG GAG CTG GCT TTG CAG TGG GTT GTT TGC AGT GGC AGC GTT 
thr lys lys leu phe his glu glu leu ala leu gin trp val val cys ser gly ser val 
3001/1001 3031/1011 

CGG GAA TCA GCT TTG CAA CAA GCC TGG TTC TTT TTT GAA TTA ATG GTA AAG AGC ATG GTG 
arg glu ser ala leu gin gin ala trp phe phe phe glu leu met val lys ser met val 
3061/1021 3091/1031 

CAC CAT TTA TAC TTT AAT GAT AAA CTT GAG GCT CCA AGG AAA AGT CGT TTT CCA GAA CGT 
his his leu tyr phe asn asp lys leu glu ala pro arg lys ser arg phe pro glu arg 
3121/1041 3151/1051 

TTC ATG GAT GAC ATT GCA GCT CTT GTC AGC ACG ATT GCT AGT GAT ATA GTT TCA CGA TTT 
phe met asp asp ile ala ala leu val ser thr ile ala ser asp ile val ser arg phe 
3181/1061 3211/1071 

CAG AAG GAC ACA GAA ATG GTT GAG AGA CTC AAT ACA AGC CTT GCA TTC TTT CTC AAT GAT 
gin lys asp thr glu met val glu arg leu asn thr ser leu ala phe phe leu asn asp 
3241/1081 3271/1091 

CTG TTG TCT GTT ATG GAC AGA GGA TTT GTT TTT AGC CTT ATA AAG TCC TGC TAT AAA CAG 
leu leu ser val met asp arg gly phe val phe ser leu ile lys ser cys tyr lys gin 
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3301/1101 

GTG TCT TCA AAG CTT TAC TCA TTA CCG AAT 
val ser ser lys leu tyr ser leu pro asn 
3361/1121 

TTT CTA CGA ATC ATC TGC AGT CAT GAG CAC 
phe leu arg ile ile cys ser his glu his 
3421/1141 

CTT ACT CCA CCT GCA TCT CCA TCA CCT TCT 
leu thr pro pro ala ser pro ser pro ser 
3481/1161 

TTT TCT ACG AAT GTA CAA GAC CAA AAG ATT 
phe ser thr asn val gin asp gin lys ile 
3541/1181 

CGC CAA CAG CAT TAT TTG GCA GGA CTT GTG 

arg gin gin his tyr leu ala gly leu val 
3601/1201 

GAT GCT GAA GGA CTG TTT GGA TTG CAT AAG 
asp ala glu gly leu phe gly leu his lys 
3661/1221 

TCC AGT CAC GAC TCA GAC CCG CGG TAC TCT 

ser ser his asp ser asp pro arg tyr ser 
3721/1241 

s TTG TAT CTA CCT CTG ATT GGT ATT ATC ATG 
fleu tyr leu pro leu ile gly ile ile met 
J3781/1261 

J GAA ACT CAC AAT CAA CGA GGA AGA CCA ATT 
I glu thr his asn gin arg gly arg pro ile 
J 3841/1281 

I AGC GGA AGT ATG ATA AGC CAG ACC GTT GCC 
; ser gly ser met ile ser gin thr val ala 
i 3901/1301 
CTA ACA AGG CCT GGC AGT TTC CTC CTC ACG 
leu thr arg pro gly ser phe leu leu thr 
3961/1321 

TCA GCA GAA TCA AGT CGA AGC CTT TTG ATC 
ser ala glu ser ser arg ser leu leu ile 
4021/1341 

GAA ACA GTT CTA CAG AAG TGG TTT ACA GAT 
glu thr val leu gin lys trp phe thr asp 
4081/1361 

GAT CTG CTT TAT CTC TGT GTG TCT TGC TTT 
asp leu leu tyr leu cys val ser cys phe 
4141/1381 

ATG AAT AGC TTG ACC TTT AAG AAA TCA AAA 
met asn ser leu thr phe lys lys ser lys 
4201/1401 

CTT GGG AGC ATA GGT GCC AGG CAA GAA ATG 
leu gly ser ile gly ala arg gin glu met 
4261/1421 

AGC CCA TCT GGA AGT GCC TTT GGA AGT CAA 
ser pro ser gly ser ala phe gly ser gin 
4321/1441 

CAC TGG CGT CAA AAC ACA GAG AAG CTT GAC 
his trp arg gin asn thr glu lys leu asp 
4381/1461 

CTG ATT GAT GGA AAC CTG GCT ACA GAA GCA 
leu ile asp gly asn leu ala thr glu ala 



3331/1111 

CCC AGT GTT CTG GTG TCC TTG AGG CTG GAT 
pro ser val leu val ser leu arg leu asp 
3391/1131 

TAT GTT ACA TTA AAC TTA CCC TGC AGC TTA 
tyr val thr leu asn leu pro cys ser leu 
3451/1151 

GTT TCT TCT GCA ACA TCT CAG AGT TCT GGA 
val ser ser ala thr ser gin ser ser gly 
3511/1171 

GCA AAT ATG TTT GAA TTA TCC GTG CCT TTC 

ala asn met phe glu leu ser val pro phe 
3571/1191 

TTA ACA GAG CTG GCT GTC ATT TTA GAC CCT 
leu thr glu leu ala val ile leu asp pro 
3631/1211 

AAA GTC ATC AAT ATG GTA CAC AAT TTA CTC 
lys val ile asn met val his asn leu leu 
3691/1231 

GAC CCT CAG ATA AAG GCT CGA GTG GCC ATG 
asp pro gin ile lys ala arg val ala met 
3751/1251 

GAA ACT GTA CCT CAG CTG TAT GAT TTT ACA 
glu thr val pro gin leu tyr asp phe thr 
3811/1271 

TGT ATA GCC ACT GAT GAT TAT GAA AGT GAG 
cys ile ala thr asp asp tyr glu ser glu 
3871/1291 

ATG GCA ATC GCA GGG ACA TCG GTC CCT CAA 
met ala ile ala gly thr ser val pro gin 
3931/1311 

TCA ACG AGT GGC AGG CAA CAC ACT ACC TTT 
ser thr ser gly arg gin his thr thr phe 
3991/1331 

TGT CTA CTT TGG GTT CTC AAA AAT GCA GAT 
cys leu leu trp val leu lys asn ala asp 
4051/1351 

CTC TCA GTC TTG CAG CTA AAC CGG CTA TTA 
leu ser val leu gin leu asn arg leu leu 
4111/1371 

GAG TAT AAA GGG AAA AAA GTG TTT GAA CGA 
glu tyr lys gly lys lys val phe glu arg 
4171/1391 

GAC ATG AGA GCA AAG CTT GAA GAA GCT ATT 
asp met arg ala lys leu glu glu ala ile 
4231/1411 

GTA CGG CGA AGC CGA GGA CAG CTC GAG AGA 
val arg arg ser arg gly gin leu glu arg 
4291/1431 

GAA AAT TTG AGG TGG AGG AAA GAT ATG ACT 
glu asn leu arg trp arg lys asp met thr 
4351/1451 

AAA TCA AGA GCA GAG ATT GAA CAC GAA GCA 
lys ser arg ala glu ile glu his glu ala 
4411/1471 

AAC CTA ATC ATT TTA GAT ACA TTA GAG ATT 
asn leu ile ile leu asp thr leu glu ile 



A 



FIG. 6 
4 of 6 



t 

4441/1481 

GTT GTT CAG ACC GTT TCT GTA ACG GAA TCC 
val val gin thr val ser val thr glu ser 
4501/1501 

GTG CTA CTA CAC AGC ATG GCC TGT AAC CAA 
val leu leu his ser met ala cys asn gin 
4561/1521 

ACA CAG AGA GCC TTG GTT TCA AAG TTT CCT 
thr gin arg ala leu val ser lys phe pro 
4621/1541 

TGT GCT GAT TTA TGC CTC AGG CTT CTC CGA 
cys ala asp leu cys leu arg leu leu arg 
4681/1561 

TCA CAC CCC AGT GCC TCC CTT TAC CTA CTA 
ser his pro ser ala ser leu tyr leu leu 
4741/1581 

TTT GCC AGG GTT AAA ATG CAG GTA CCA* ATG 

phe ala arg val lys met gin val pro met 
4801/1601 

AAT TTT AAT GAA GAA TTC TTA AGA CGT TCT 
asn phe asn glu glu phe leu arg arg ser 
4861/1621 

GAT CTG GAA TTG AGG GAA ACA ACA TTT CCT 
asp leu glu leu arg glu thr thr phe pro 
4921/1641 

CAT ATG ATT CTT TCT GAT ACT GTG AAA ATG 

his met ile leu ser asp thr val lys met 
4981/1661 

ATT GAT CTA ATG TAC AGA ATT GCC AAG GGT 
ile asp leu met tyr arg ile ala lys gly 
5041/1681 

TTG CAG AAC ATG GCA GGC AAG CAC TCA GAA 
leu gin asn met ala gly lys his ser glu 
5101/1701 

CTA GTC CAC TCA GCA GCA CTT GTT GCT GAA 
leu val his ser ala ala leu val ala glu 
5161/1721 

CTT CCT GTG GGA TGT GTA ACA TTT CAG AAT 
leu pro val gly cys val thr phe gin asn 
5221/1741 

GTC TCA GAT GAT GTG GTA TCT CCA GAT GAA 
val ser asp asp val val ser pro asp glu 
5281/1761 

GAG TCA GGA CTT GTG GGA TTA CTG GAA CAA 
glu ser gly leu val gly leu leu glu gin 
5341/1781 

TAT GAA GCA GTT AAT GAA GTT TAC AAA GTA 
tyr glu ala val asn glu val tyr lys val 
5401/1801 

GCA AAG AAA CTA TCC ACA ATT CAT GGT AAA 
ala lys lys leu ser thr ile his gly lys 
5461/1821 

CAG AGT ACT GGC TGG GAG CGG ATG TTT GGC 
gin ser thr gly trp glu arg met phe gly 
5521/1841 

AAG TTC GGG GAT TTG GAT GAA CAA GAA TTT 

lys phe gly asp leu asp glu gin glu phe 




4471/1491 

AAA GAG AGC ATT CTT GGT GGA GTG CTA AAA 
lys glu ser ile leu gly gly val leu lys 
4531/1511 

AGT GCA GTT TAT CTA CAA CAC TGT TTT GCT 
ser ala val tyr leu gin his cys phe ala 
4591/1531 

GAA CTC TTA TTT GAA GAA GAG ACA GAG CAG 
glu leu leu phe glu glu glu thr glu gin 
4651/1551 

CAC TGT AGC AGT AGC ATC GGT ACA ATA CGG 

his cys ser ser ser ile gly thr ile arg 
4711/1571 

ATG AGG CAA AAC TTT GAG ATT GGG AAT AAC 

met arg gin asn phe glu ile gly asn asn 
4771/1591 

TCA CTA TCC TCC TTG GTG GGC ACA TCT CAG 

ser leu ser ser leu val gly thr ser gin 
4831/1611 

CTA AAG ACT ATA TTG ACA TAT GCT GAA GAA 

leu lys thr ile leu thr tyr ala glu glu 
4891/1631 

GAT CAG GTC CAG GAT CTG GTT TTC AAT CTC 
asp gin val gin asp leu val phe asn leu 
4951/1651 

AAG GAA CAC CAG GAG GAT CCT GAA ATG TTG 

lys glu his gin glu asp pro glu met leu 
5011/1671 

TAC CAG ACC TCT CCA GAG CGA TTG ACC TGG 
tyr gin thr ser pro glu arg leu thr trp 
5071/1691 

CGA AGC AAT CAT GCT GAA GCT GCA CAG TGT 
arg ser asn his ala glu ala ala gin cys 
5131/1711 

TAT TTG AGC ATG CTG GAG GAC CGG AAA TAT 
tyr leu ser met leu glu asp arg lys tyr 
5191/1731 

ATT TCA TCT AAT GTT TTA GAA GAA TCT GCG 
ile ser ser asn val leu glu glu ser ala 
5251/1751 

GAA GGT ATC TGC TCT GGA AAA TAC TTT ACT 
glu gly ile cys ser gly lys tyr phe thr 
5311/1771 

GCA GCT GCT TCC TTC TCT ATG GCT GGC ATG 
ala ala ala ser phe ser met ala gly met 
5371/1791 

CTT ATT CCT ATT CAT GAA GCT AAT CGG GAT 
leu ile pro ile his glu ala asn arg asp 
5431/1811 

CTT CAA GAA GCA TTC AGC AAA ATT GTT CAT 
leu gin glu ala phe ser lys ile val his 
5491/1831 

ACC TAT TTT CGT GTT GGT TTT TAT GGA ACC 
thr tyr phe arg val gly phe tyr gly thr 
5551/1851 

GTT TAC AAG GAG CCT GCA ATA ACC AAA CTT 
val tyr lys glu pro ala ile thr lys leu 
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5581/1861 5611/1871 

GCA GAG ATA TCT CAC AGA TTG GAG GGA TTT TAC GGA GAA AGA TTT GGA GAG GAT GTG GTT 
ala glu ile ser his arg leu glu gly phe tyr gly glu arg phe gly glu asp val val- 
5641/1881 5671/1891 

GAA GTA ATC AAA GAC TCT AAT CCT GTA GAC AAG TGT AAA TTA GAT CCT AAC AAG GCA TAT 
glu val ile lys asp ser asn pro val asp lys cys lys leu asp pro asn lys ala tyr 
5701/1901 5731/1911 

ATT CAG ATT ACC TAT GTG GAG CCA TAC TTT GAC ACA TAT GAG ATG AAG GAC AGA ATC ACC 
ile gin ile thr tyr val glu pro tyr phe asp thr tyr glu met lys asp arg ile thr 
5761/1921 5791/1931 

TAT TTC GAC AAA AAT TAC AAT CTT CGT CGA TTC ATG TAC TGT ACA CCC TTT ACT TTA GAT 
tyr phe asp lys asn tyr asn leu arg arg phe met tyr cys thr pro phe thr leu asp 
5821/1941 5851/1951 

GGC CGT GCC CAT GGG GAA CTT CAT GAA CAA TTC AAA AGG AAG ACC ATT CTG ACT ACG TCT 
gly arg ala his gly glu leu his glu gin phe lys arg lys thr ile leu thr thr ser 
5881/1961 5911/1971 

CAT GCC TTT CCT TAT ATT AAA ACA AGG GTC AAT GTC ACT CAT AAA GAA GAG ATC ATC TTA 
his ala phe pro tyr ile lys thr arg val asn val thr his lys glu glu ile ile leu 
5941/1981 5971/1991 

ACA CCA ATT GAA GTT GCT ATT GAG GAC ATG CAG AAA AAG ACA CAG GAG TTG GCA TTT GCA 

thr pro ile glu val ala ile glu asp met gin lys lys thr gin glu leu ala phe ala 
6001/2001 6031/2011 

ACA CAT CAG GAT CCC GCA GAC CCC AAA ATG CTT CAG ATG GTA CTC CAG GGA TCT GTA GGC 
-„thr his gin asp pro ala asp pro lys met leu gin met val leu gin gly ser val gly 
^6061/2021 6091/2031 

J ACC ACA GTG AAT CAG GGG CCT TTG GAA GTT GCC CAG GTT TTT CTG TCT GAA ATA CCT AGT 
J thr thr val asn gin gly pro leu glu val ala gin val phe leu ser glu ile pro ser 
J 6121/2041 6151/2051 

.J GAC CCA AAG CTC TTC AGA CAT CAT AAT AAA CTG CGA CTC TGC TTT AAA GAT TTT ACT AAA 
i asp pro lys leu phe arg his his asn lys leu arg leu cys phe lys asp phe thr lys 
I 6181/2061 6211/2071 

i: AGG TGT GAA GAT GCC TTA AGA AAA AAT AAG AGC TTA ATT GGG CCG GTT CAA AAG GAG TAT 
8 arg cys glu asp ala leu arg lys asn lys ser leu ile gly pro val gin lys glu tyr 
s 6241/2081 6271/2091 

1 CAA AGG GAA TTG GGG AAA CTA TCT TCG CCT TAA 

J gin arg glu leu gly lys leu ser ser pro OCH 

1 AGAGGCCCTACAGCCCTAGATCACAGAAAGTCCCTCAGTTATCCAAGCCAGTATTGCTTGTCCCCTGCCACAGAGATTCC 
1 TTGAGTCGAATGAGCTTTCGCAAAATGGATCTCTAAACTGAATGCACTTGTTTT 

AACAT CGAGT GT GAAAAGAT CT ATT GGAAACCAACAT GGAAT GGAAT T CT G GAAATT ATTATT CATT GAAGAAT GCAGT G 

GCCAAGAAAATATCAAATGTAGATTGTTAACGCTTGAGAATCATGGCTAT 

TCTTTTAAGACATTTTAATGACTCAAAGGTACACTATACATTTACCATTATTTATACCATAGCTAAGGTTAAAAATTTAT 
TCACTTTAAGTTCGTATTTTTTAATTTAT/^TCACCATTTATAGATTCATTTTGGACCCATTTTAAATGTAGTAATGCTTA 
TTTTAAAGGTACTAAAAAATATGTG^TGTTTACCTCCT 
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A. Allelic Variations: single nucleotide changes (polymorphisms) between hCLASP-3 
cDNAs and/or genomic sequences 



.Isoform 


Difference 


Nucleotide 
position 


Consequence 


1 


polymorphism 


318 


A to G; missense 


2 


polymorphism 


323 


A to G; Glu to Gly 


3 


polymorphism 


2187 


T to C; missense 


4 


polymorphism 


3165 


T to G; Asp to Glu 










B. Alternative Exon usage 


Isoform 


Difference 


Nucleotide 
position 


Consequence 


1 


Exon deletion 


2768-2860, 
inclusive 


In frame deletion - 33 amino 
acids removed 











These differences may be found separately or together in various combinations in the 
differenct human CLASP-3 isoforms 
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Exon 60270 - 20370 

rr^rw a r^C'C^CC^.C.C^.C. GTC.GC.CGTCGCCGCAGCAGCCAT GGCCGAGCGC 
r.GCGCCTTCaCCCAGAAGATCAGCAGGTAAATATCCGGCGTGGGGCGC 



Exon 85360-85510 

r;TTTTTr^TTT(TTTrATTGTA GAACGGTGGCAGCCGAAGTTAGG AAGCAGATC 
TCCGGACAATATAGTGGTTCTCCCCAACTGCTCAAAAACCTTA ATATTGTTGG 
rAATATATCCCATCACACCACAGTA AGTAACGTATTCAAAATATA 



Exon 94500 - 94720 

tttt A TCCC. A A (TTTTTT A f! A A AGG TGCCCCTTACCG A AGC AGTAGATCCAGTG 
GATTTGGAAGATTACCTCATTACTCATCCTTTGGCTGTGGATTC TGGGCCTTT 
ACGGGATTTGATTGAATTTCCTCCAGATGATATTGAAGTTGTT TATAGTCCTC 
GGGACTGCAGAACTCTTGTTTCAGCTGTACCTGAAGAAAGG TAAGGAGACAT 

TGACTTATT 



Exon 94870 - 94980 

TATTTTrrTTTTTAAAATA GTGAAATGGATCCACATGTTAGAGA CTGTATAAG 
AAGTTATAGAGAAGACTGGGCAATTGTCATCAGAAAG TAAGTTATATGTTTA 

TTACAA 

Exon 100110- 100290 

atttattta Af . ' I ' t ' l ' l ' l 1 KM I I A ATA GATATCATAAATTGGGAACAGGATTTAA 
TCCCAATACATTAGATAAACAGAAAGA AAGGCAAAAAGGTTTGCCAAAACA 
A GTTTTTGA ATCTG ATGAAGCTCC AGATGGC AAC AG CTACC AGG ATGATC AA 
GTAATACTTTTATTCTTAAATAA 



Exon 100340- 100600 

ATATTTA ATnTTTTflT!ATGAr.AG GATGACCTTAAAAGACGTTCAAT GTCAATA 

GATGATACCCCAAGGGGTAGCTGGGCCTGTAGTATCTTTGACTTG AAAAATT 

CACTTCCTGATGCTTrGCTTCCCAATTTACTTGATCG AACTCCAAATGAAGAA 

ATAGACCGTCAGAATGATGACCAAAGGAAATCAAA CCGTCACAAAGAACTTT 

TTGCTTTGCATCCATCACCAGATGAGGT ATAGATGTTTGCATATAAAGAA 
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Exon 100880-101020 



TTTTr T r T TrTTTGCTTTTCAATTTGTAGG .^AGAACC.\ATAGAACGGCTTAGTGTT 
CrTGATATACCCAAAGAACATTTTGGTCAAAGACTTCTTGTAAAATGCTTATC 
ACTCAAGTG AGTATTTATTTCTTTTACTTACAACT 

Exon 112010-112120 

TTTTTr.TTr.ATAAAGG TTTGAAATTGAAATTGAACCCATTTTTGCA 
CTTTATATGATGTCAAGGAAAAGAAAAAGG TAAGATTATATAATTTGACCAT 

AGTTAT 

Exon 113680-113880 

A AfrTTTA ArATACTAATATTTTTTA GATTTCAGAAAACTTTTATm 
ATTCTGAGCAGATGAAAGGGTTGTTACGTCCACATGTACCACCTGCTGCCATT 
ACTACCCTGOCAAGATCAGCAATTTTTTCTATCACITATCCTTCCCAAGATGT 
TTTTCTTGTAATAAAGGTGAGAATAATGTTAAATATATTTG 



Exon 115020-115160 

TTA ATrTTAACTTTTTTTGC^CTTTGACA GCTAGAAAAAGTCCTACAGCAAGGA 
GACATTGGAGAGTGTGCAGAACCATATATGATTTTCAAAGAAGCAGATGCCA 
CCAAGGTAGAATGTTATG<rrTCTCATTTCCGCCAC 



Exon 117200- 117410 

ATPtT AT A A AGTTCTGTTTTGC A GAATAAAGAAAAACTGGAGAAACTGAAG AG 
TCAAGCAGATCAGTTTTGCCAAAGACTTGGGAAATATCGCATGCCTTTTGCTT 
nGACTGCAATCCATTTAATGAATATTGTTAGCAGTG CTGGGAGTTTGGAAAG 
AOATTCTACAGAAGTAGAAATCAGTACTGGAGG TAAGAGTGTTTCATACAAA 

AC 



Exon 123200 - 123396 

A A A A Tft A A I • I ■ I ' n ' 11 ■ II 1 A ATTCTTTTGTA G A ACGAAAAGGGTCTTGGTC AGA 
GAGGAGGAATTCTAGTArrGTTGGCAGACGATCACTTGAAAGGACAACAAGT 
GGAGATGATGCTTGTAACTTGACGAGCTTTCGACCAGCTACTCTCACAGTGAC 
AAAl'lll' fTTA ACtC AGGT ATTGTTCTGTC ATGT AGGAATTTT 
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(Next part of CLASP, starting 

GAAGGAGACCGCTTAAGTGATGAAGATCTCTACAAATTCCTTGCTGATATGA 
GAAGGCCATCTTCTGTCTTACGGCGACTAAGACCTATTACAG) 

Exon 5560-5710 

r.TTTTTCCTCTATTATTGAAATCAGGAAGGAGACCGCTTAAGTGATGAAGATC 
TCTACAAATTCCTTGCTGATATGAGAAGGCCATCTTCTGTCTTACGGCGACTA 
AG A CCT ATT ACAGG TATTTAAAAATTTTGAGTAGAAATGGTTGCA 

Exon 6680-6900 

TTAr.ATTGTTTTTTAATATATAATTTGCAGCTCAGCTCAAGATAGACATTTCTC 

CCGCACCTGAAAATCCCCATTATTGCCTAACTCCGGAGCTGCTTCAAGTGAA 

nCTTTACCCTGACAGTAGAGTTAGACCrACCAGAGAAATCTrAGAGTTTCCCG 

CAAGGGATGTTTATGTrCCAAACACTACTTACAGG TAAGAGATTTrAATTTGG 

AGAATTCTG 

Exon 38920 - 39075 

fiTATTTACTATCATCTATACTGrTGCTTTCACA GAAATCTTCTCTACATATACC 
CTCAGAGTCTTAATrTTGCCAATCGTCAAGGTTCTGCTAGAAATATAACAGTG 
AAAGTCCAGTTTATGTATGGAGAGGATCCAAGCAATGCCATGCCGGTAA 
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GTAAGCTGCAGAGTACATGCAAAGTCCTTTCAGACACTTTGGCAACTAGTGAGTCATGT 
ATACAAAGAACTATTCAAAGCAAAATGTGGCCAGTTCTCTAAGAGAGTAGTGAAAAGAT 
GTTTGGAGAACAGAGATAAAGGCATGCAACATCTGGCTGGGAGTGTGAGAGCAGACAAC 
CAATTCAGGGACTATCAAACACCCAGGACAAGCTTAAGTACAGGATGAATAATGACAGT 
AACAGTGATAATAATAACAACAGCTGACACACATCGTACTTACCATGAACCAGGCCCTG 
TTCTAAGAACTTTATATTAATCAATTCAATAAATCTTCACAACACCCTTTCAAGTAGAT 
ATAATTATTTTCCTCCTTTTAGAGATGAAGTTAAGTAGCTTGCCTAAGGTTACAGAGTA 
AGAAAGTGGCAAAATAAGTCTGGGTTCCTATCACAGAGGGTCTTAAAAGCCAGGTACAG 
TGATAGCCTTGACTTACAGTCTGTGAGTACCTGGTTCATCTTTCCCAATGGACTCAAAC 
ATTCTGAGAGCAGGTCTGTGTCTAGTTCACCTTCGTGTTTTTTAGAACACCTAGGTGGG 
CTCTATACTTAGGTAGAAGCCAJATATCTGTCCTCTTTGAATAGACTATCTTTTCCTTT 
AGTAAACAAGCATCTGAAAACAGTGTGTTACTGGAGGGCGCCTTGGCAAGAGCCTGTGG 
TACTAGCTACTCCTCCAGAGGCTGAGGCAAGAGGATTGCTTGAGCCTAGGAGTTTTAGG 
CTGCAGTGCCCTATGCATTCCATGCTTGAGCAACACAGGGGGATTCTGTCTAAAATAAA 
TAAATCAATAAATAAAAGCAGTGTGTTACTGCAATGAGCATTGTTATATAAATACACAG 
TCTTCCACAGAAGTTACAGCACTAGACAGGAAGTTATGGATCTGACTAATTGAGGCTTT 
CACCTCTGACTCAGTGTGTGACTCTGGCAAGTGACCTAATGGCTCCAAGCTACAGATTT 
TCCAATGGTAAAAGAGAGGAAAACAGGTATCCAACATATTCCACCAGGACATACTGAGA 
ATATAATAATAACAATAAATAAJlTACACATTTTAAGTTCTTGCAATACTTAAGAGGAAA 
GGTACTACAAGAAACCAAGATATTTTTCTTAGTGTCACCATAATTTCAGTATCAACAGA 
TATGCATTAGGAGCCAAATATTCTAGACATTATGGGATTACAACAGAACAGATGAAAAC 
AAATCTCTGCTAGTCTCTGTTTTATAGGCTATTATGGACAAGGTCAGTTTCAGTGGTCA 
ACACAGAGCTTATTTAACAGAACATCAAAATGGGGGTGAACGTTAGAGCTCACCTGGAT 
TTTATATATATATATATATATATATATATATATATATATATATGTATGAGAGACAGAGA 
GAGATCCAAAAGGGGTTAATAATACGGAAACAACTGTCCTCAAAGAAAAGAACCGGCTC 
TGGGTTTGAGTCTCAGTGGAAGAGTTTACCATCTGTGCGACCTGGGGCCAGTCACTTAG 
CTTTTCCAAGCCAATTTCCTTCCCTCTAAAATAGTGATAGTAACGATCTACTTCAAAGC 
GCTCGTGCTTGAAAAACTTTAATCCAGTGGCTGGCTAAGCACCAGAAGTCAACTATTAC 
TATTGTTATCGCTGCAGGAGCAJiGGTATTTTCTGGCTCTTTCTCTATAAGAAAAACCAC 
TGTTTCTCCAGTTAGGCTGCATTCATGAGGCCGAAGCAGAAACCCAAGTGCTTTAAAAA 
GCTCGGAGACCGGTGCTGCAGAAACATGAACCCAGTCATCCAGGGCTTTGGTTAAAGCA 
CAGCCCCCTCGGCAAAGCCCACCTTTCATTTCCTTCCCTTCCATTTCGGGAAAGCCAAG 
CTCCCCCTACGCGTTCGGTTATCTTATTTTCTTGCCTCGCCAGGTCGCTGGCCTGTCTC 
TGGCTTCTCTCCGCGACCTCTCGGTCGTGCAGGCTCCGCGGCAGCGAAGCGGCTGGGGC 
TGGCGCCTCGCTTCCTCAGCGCTCCATTCTTCCCCTCGGCTCCCGCCGGCCGCAGCCGC 
CTTCCGCAGCCGGGGTTCCCGCCGGGATTGACGCGCTGGGGGAGGAGCGGTTTCTCGTT 
GCGCGCCTCTAAGGAACATTACGGCAGGGCTCGTTCCTGGCTCCGGCCGCCAGCCCCAG 
CCTCCCAGGTCCGGAGCCCGGACTGGCGGAGGCCGCGAGGGAGGGAGCACGAGCGAGGA 
GGCACGCGCCCGCCGGTCCGCACCCGGCCCGCCGCAGCCGCCGCCGC CGTCGCCGTCGC 
CGCAGCAGCCATGGCCGAGCGCCGCGCCTTCGCCCAGAAGATCAGCAG 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
HCLASP7 
hCLASPl 



M FPME D I S I S V I GRQRRTVQ 20 

MTHLNSLDVQLAQELG 16 

MAERRAFAQK I S RT VAAEVRKQ I SGQ YSGS PQLLKNLN I VG 4 1 

MLL FP Y DD FQTAI LRRQGRYI CS 23 

MAASERRAFAHKINRTVAAEVRKQVSRERSGSPHSSRRCSSSL 43 

MS FRGKVFKRE PSE FWKKRRTVRRVI QEE FHRFS SQEKPRLLEPLDYETVIEELEKT YRN 60 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



S TVPEDAEKRAQSLFVKEC I KTYS TDWHWNYK 53 

DFT 19 

N ISHHTTVPLTEAVDPVDLEDYLITHPLAVDSGPLRDLIEFP 83 

TVPAKAEEEAQS LFVTEC I KT YNSDWHLVNYK 55 

G VPLTEWEPLDFEDVLLSRPPDAEPGPLRDLVEFP 79 

DPLQDLLFFPSDDFSM.TVSWDIRTLYSTVPEDAEHKAEKLLVKEACKFYSSQW}nA^K 120 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hgSASPl 



YEDFSGDFRMLPCKSLRPEKIPNHVFEIDEDCEKDED 

DDDLDWFT PKECRTLQP- S LPEEGVELDPHVR 

PDDIEWYSPRDCRTLVS-AVPEE-SEMDPHVR 

YEDYSGEFRQLPNKWKLDKLPVHVYEVDEEVDKDED 

ADDLELLLQPRECRTTEP-GI PKD-EKLDAQVR 

YEQYSGDIRQLPRAEYP^PEKLPSHSFEIDHEDADKDEDTTSHSSSKGGGGAGGTGVFKSG 180 



SSSLCSQKGGVIKQG 105 

DCVQTYIREWLI €3 

DCIRSYTEDWAI 126 

-AASLGSQKGGITKHG 107 
AAVEMYIEDWVI 122 



hCpVSP4 
hCLASPS 
hCLASP3 
tJlLASP2 
hitASP7 
HgLASPl 



WLHKANVNSTIT — VTMKVFKRRYFYLTQLPDGSYILNSYKDEKNSKESK-GCIYLDACI 162 
VNRKNQGSPEIC — GFKKTGSRKDFHKT-LPKQTFESETLECSEPAAQA — GPRHLNVLC 118 
V I RKYHKLGTG F — NPNTLDKQKERQKG-LPKQVFESDEAPDGNS YQDDQDDLKRRSMS I 183 
WLYKGNMNSAIS — VTMRSFKRRFFHLIQLGDGSYNLNFYKDEKISKEPK-GS IFLDSCM 164 
VHRRYQYLSAAY — SP^TDTQRERQKG-LPRQVFEQDASGDERSGPEDSNDSRRGSGSP 179 
WLYKGNFNSTVNNTVTTOSFK?ORYFQLTQLPDNSYIMNFYKDEKISKEPK-GCI FLDSCT 239 



h€LASP4 
hfiLASPS 
HCLASP3 

ffiLASP7 
MLASP1 



DWQCPKMRRHAFELKMLDKYSHYLAAETEQEMEEWLITLKKI IQINTDSLVQEKKETVE 222 

DVS GKG P VTAC D FDLRS LQP DKRLENLLQQVS AE D FEKQNEEARRTN RQAE 1 69 

DDTPRGSWACS I FDLKNSLPDALLPNLLDRTPNEEIDRQNDDQRKSN RHKE 234 

GVVQNNKVRRFAFELKl^QDKSS YLIAADSEVEMEEWI TILNKI LQLN FEAAMQEK 219 

EDTPRSSGASS I FDLRNLAADSLLPSLLERAAPEDVDRRNETLRRQH RPPA 230 

GWQb^RLRKYAFELKIWDLTYFVIJU^TESI^IDEWIHTLNRILQISPEGPLQG^ 299 

+ * + • * * * 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



TAQDDETSS QGKAEN IMAS LERSMHPELMKYGRETEQLNKLSRGDGRQNLFS FDSE 278 

LFALYPSVD EEDAVEIRPVPECPKEHLG N -RILVKLLTLKFEIE 2 12 

L FALHPSP D EEEP IERLS VPDI PKEH FG QRLLVKCLSLKFE IE 277 

RNGDSHEDD EQSKLEGSGSGLDSYLPELAKSAREAEIK LKSESRVKLFYLDPD 272 

LLTLYPAPD EDEAVERCSRPEPPREHFG QRILVKCLSLKFEIE 273 

TDLGLDSLDNSVTCECTPEETDSSENNLHADFAKYLTETEDTVKTTRNMERLNLFSLDPD 359 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
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VQRLDFS GIEPDIKP-FEEKCNKRFLVNCHDLTFNILGQIGDNAKGPPTNVEPFFI 333 

IEPLFAS IALYDVKERKKISENFHCDLNSDQFKGFLRAHTPSVAASSQARSAVFSV 268 

I EP I FAS LAL YDVKEKKK I SENFY FDLNSEQMKGLLRPHVP PAAI TTLARSAI FS I 333 

AQKLDFS SAEPEVKS-FEEKFGKRILVKCNDLSFNLQCCVAENEEGPTTNVEPFFV 327 

IEPIFGI LALYDVREKKKI SENFYFDLNSDSMKGLLRAHGTHPAI STLARSAI FSV 329 

IDTLKLQKKDLLEPESVIKPFEEKAAKRIMIICKALNSNLQGCVTENENDPITNIEPFFV 419 
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hCLASP4 
hCLASP5 
hCLASP3 
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hCLASP7 
hCLASPl 



NXJVLFDVKNNCKISADFHVDIJ^PPSVREMLWGSSTQIJ^DGSP KGSSPESYIHGIAE 390 

TYPSSDIYLWKIEKVLQQGD IGDCAEPYTVIKESDG GKSKE-KIEKLKL 317 

T Y P S QDVFLV I KLEKVLQQGD 1 GECAE PYM I FKEADA TKNKE - KLEKLKS 382 

TLSL FDI KYNRKI SADFHVDLNHFS VRQMLATTS PALMNGS GQSPSVLKGILHE 381 

TYPS PDI FLVIKLEKVLQQGD ISECCEPYMVLKEVDT AKNKE-KLEKLRL 37 8 

SVALYDLRDSRKISADFHVDLNHAAVRQMLLGASVALENGNIDTITPRQSEEPHIKGLPE 479 
^ • ^ • • • * 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



SQLRY I QQG I FSVTNPHPE I FLVAR I EKVLQGNI THCAEP Y I KNSDPVKTAQKVHRTAKQ 450 

QAE S FCQR LGKYRMP FAWAP IShSS FFNVS TLEREVT DVDSWGRS PVGERRT LA 372 

QADQFCQR LGKYRMP FAWTAIHLMNIVSSAGSLERDSTEVEI STGERKGSWSERR 437 

AAMQYPKQGI FSVTCPHPDI FLVARIEKVLQGSITHCAEPYMKSSDSSKVAQKVLKNAKQ 441 

AAEQFCTR LGRY RMP FAW TAVHLAN I VS S AGQLDRDS D SEGERRPAWTDRR 429 

EWLKFPKQAVFSVSNPHSEIVLVAKIEKVLMGNIASGAEPYIKNPDSNKYAQKILKSNRQ 539 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCtASP4 
hClASPS 
hfeiASP3 
hbLASP2 
hj&tASP7 
KGXASPl 



VCS RLGQYRMP FAWAARP I FKDTQGSLDLDGF FS PLYKQDS SKLS SEDI LKLLSEYKKPE 

QS RRLSERALSLEENGVG SNFKTS TLS VSS FFKQEGDRLS DEDLFKFLADYKRS S 

NSSIVGRRSLERTTSGDBACNLTSFR-PATL1 VTNFFKQEGDRLSDEDLYKFLADMRRPS 
ACQRLGQYRMP FAWAARTL FKDAS GNLDKNAF FS AI YRQDSNKLSNDDMLKLLADFRKPE 

RRGPQ- - DRAS S GDDACS FS G FR- PATL1 VTNFFKQEAERLSDEDLFKFLADMRRPS 

FCSKLGKYRRAFAWAVfLSVFKDNQGNVDRDSFFSPLFRQESSKISTEDLVKLVSDYRRAD 



* « 



* • • « • 



— KT KLQ 1 1 PGQLN I TVECVPVDLSNC I TS S YVPLKPFE- KNCQNI TVEVEE FVPEMTKY 
S LQRRVKS I PGLLRLE I S TAPE 1 1 NCCLTPEMLPVKP FP- ENRTRPHKE I LE FP — TRE V 
SVLRRLRPITAQLKIDISPAFENPHYCLTPELLQVKLYP-DSRVRPTREILEFP — ARDV 
K-MAKLPVILGNLDI T I DNVSS DFPNYVNSSYI PTKQFETCSKTPITFEVEEFVPCI PKH 
SLLRRLRPVTAQLKI D I S PAPENPHFCLSPELLHI KPYP-DPRGRPTKE I LEFP — AREV 
R- 1 S KMQT I PGS LD I AVDNVPLEHPNCVTS S FI PVKP FNMMAQTE PT VEVEE FVYDS TKY 



510 
427 
496 
501 
483 
599 



567 
484 
553 
560 
540 
658 



hCLASP4 
KCLASP5 

H<Xlasp3 

HCLASP2 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 

hCLASPl 



CY P FT I YKNHLYVY PLQLKYDSQKT FAKARN IAVCVE FRDS DES DAS ALKCI YGKF \GSV 627 
YVPHTVYRNLLYVYPQRLNFVN — KLASARNITIKIQFMCG-EDASNAMPVIFGKS 3GPE 541 
YV PNTTYRNLLYI YPQSLNFAN — RQGSARN I TVKVQFMYG-EDPSNAMPVI FGKS SCSE 610 
TQPYTIYTNHLYVYPKYLKYDSQKSFAKARNIAICIEFKDSDEEDSQPLKCIYGRP 3GPV 620 
YA PHT S YRNLL YVYPHS LN FS S — RQGSVRNLAVRVQYMTG-EDPSQALPVI FGKS SCSE 597 
CBPYRVYKNQIYIYPKIiLKYDSQKCFNKARNITVCIEFKNSDEESAKPLKCIYGKFfeGPL 718 



** 



:iYGKFE< 

* : * ; A 



FTTC ayawshhnqnpe 



E FYDE I KI ELP I HLHQKHHLLFT FYHVS Cd I NTKGTTKKQD'tfvE 687 

FLQ3VYTAVT YHNKSPDFYEEVKI KLPAKLTVNHHLLFT FYH I SC( Q KQGAS VE 595 

FSKI AYTAWYHNRS P D FHEE I KVKLPAT LTDHHHLL FT FYHVS C< Q KQNTI LE 664 

FTR< AFAAVLHHHQNPEFYDEIKIELPTQLHEKHHLLLTFFHVSCI NS SKGS TKKRD\ VE 680 

FTRi AFTPWYHNKS PE FYEE FKLHLPACVTENHHLLFT FYHVS CC P RPGTJ LE 651 

FTSJ AYTAVLHHSQNPDFSDEVKIELPTQLHEKHHILFSFYHVTCI INAKANAKKKEJ LE 778 



* • • • • * 



TPVGFAWVPLLKDGRI ITFEQQLPVS ANLPPGYLNLNDAESRRQCNVE IKWVDGAKPLLK 
TLLGYS WLPI LLNERLQTGS YCLPVA LEKLPPNYSMHSAEKVPLQNPF IKWAEGHKGVFN 

TPVGYTWIPMLQNGRLKTGQFCLPVS LEKPPQAYSVLSPEVP LPG yQCWVDNHKGVFN 

TQVGYSWLPLLKDGRVVTSEQHIPVS ?^NLPSGHLGYQELGMGRHYGPEIKWVDGGKPLLK 

TPVGFTWIPLLQHGRLRTGPFCLPVS VDQPPPSYSVLTPDVA LPG tfRWVDGHKGVFS 

TSVGYAWLPLMKHDQIASQEYNIPIA TSLPPNYLSFQDSASGKHGGSE IKWVDGGKPLFK 



747 
655 
721 
740 
708 
838 



* • • 
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hCLASPS 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
' hCLASP7 
hCLASPl 



HS3-ASP4 
hfijASPS 
KGLASP3 
hqLASP2 
h£LASP7 

hfiLASPl 

rfttASP4 
HCLASPS 
te(iLASP3 
I$LASP2 
&<§LASP7 
hGLASPl 

i- ^ 

fc!LASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 





VEWAVSS IHTQDPYLDKFFALVNALDEH 
I S THLVS TVYTQDQHLHNFFQYCQKTES - 



* + 



FKSHLEST I YTQDLHVKKFFHHCQL I QS GS KEVPGEL I KYLKCLHAM 794 

I EVQAVS S VHTQDNHLEKFFTLCHS LES^VT FP I RVLDQK I S EMALEHELKLS I ICLNS S 715 

L FPVR I GDMR IMENNLENE LKS S I S ALNS S 7 80 

GAQALGNELVKYLKS LHAM 787 

VELTAVSSVHPQDPYLDKFFTLVHVLEEfe-AFPFRLK 767 
VSTFWSTWTQDPHVNAFFQECQKREKD MSQSPTSNFIRSCKNLLNVE 887 



KIHAIM 



IMSFLPIILNQLFJK- 



EIQVMIQFLPVILMQLFR VLTNMTH EDDVP 824 

RLEPLVLFLHLVLDKLEQLSVQPMVIAGQTANFSQFAFESWAIANSLHNSKDLSKDOHG 77 5 
QltPVVRFLHLLLDKL3JLLVIRPPVIAGQIVNLGQASFEAMASIINRLHKNLEGNHDQHG 840 

VLT-RAT QEEVA 816 

SFfcPLVAFSHHVLDKLVjRLVIRPPI ISGQIVNLGRGAFEAMAHWSLVHRSLEAAQDARG 827 

VLVQNE EDEIT 916 



INCTMV-LLHIVSKCHEEGLDS YLRSFIKYS FRPEKP 

RNCLLASYVHYVFRLPEVQRDVPKSGAPTALLDPRSYHTYGRTSAAAVSSKLLQARVMSS 
RNSLLASYIHYVFRLPNTYPNSSSPG-PGGLGGSVHYATMARSAVRPAS 

VNVTRV- I IHWAQCHEEGLES HLRSYVKYA YKAEPY 

HCPQLAAYVHYAFRLPGTEPSLPDGAPP VTVQAATLARGSGRPASLYLARSKS I S S 

TTVTRV- LPD IVAKCHEEQLDH SVQSYIKFV FKTRAC 



860 
83S 
899 
852 
883 
952 



SAPQAQLIH ETLATTMIAILKQS : 883 

SNPDLAGTHSAADEEVKNIMSSKIADRNCSRMSYYCSGSSDAPSSPA '882 

SNPDISGTPTSPDDEV1KSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAESTQAMDRSC 959 

VASEYKTVH EELTKSMTT I LKPS 875 

SNPDLAVAPGSVDDEVSRI LASKLLHEELA- LQ 915 

KE RPVH EDLAKNVTGLLKSN " 972 



ADFLS INKLLKYS WFFFEI IAKSM 

APRPASKKHFHEELALQ : MWSTGMVKSM 

NRMSSHTETSSFLQTLTGRLPTKKLFHEELALQVfVVCSGSVRESALQQAWFFFE 

ADFLTSNKLLRYS WFFFDVLIKSM 

WVVS S SAVREAI LQHA WFFFQLMVKSM 

DS PTVKHVLKHS WFFFAI I LKSM 

Cadherin Cleavage 



AT YLLEENKI KLf RGQR fPET YHHVLHS LLLAI I PHVT I RYAE I PDE S RNVN1 S LAS 

AQHVHNMDKRDS B RRTR rSDRTODDITTIVNWTSEIAALLVKPQKENEQAEKMNI SLAF 

VHHLYFNDKLEAJ RKSR "PERFMDD I AALVST I ASD I VSRFQKDTEM VERLN1 SLAF 

AQHLIENSKVKLI RNQR rP AS YHHAAE TWNMLMPH I TQK FGDNPEA SKNAN* SLAV 

RKLR fPGRFLDDI TALVGS VGLE VI TRVHKDVEL AEHLRP SLAF 



ALHLLLGQRLDTE 

AQHL I DTNKIQLJjRPQRlTPES YQNELDNLVMVLSDHVI WKYKDALEE TERATfc 



SVAR 
* • * 



FLKRCLTLMDRGFIF SLINDYISGFSPKDP 



FLYDLLS LKDRG FVF S1LIRHYCSQLSAKLSNL F 



-F VLAEYKFEFLQT ICNHEHYI PLNL 
TL I SMRLEFLRI LCSHEHYLNLNL 
FLNDLLSVMDRGFVilSLIKSCYKQVSSKLYSLPNPS VLVS LRLDFLRI ICSHEHYVTLNL 

FIKRCFTFMDRGFVF SQINNYISCFAPGDP h TLFEYKFE FLR WCNHEHY I P LNL 

FLS DLLS LVDRG FVF SLVRAHYKQVATRLQSS PNP2 ALLTLRME FTRI LCS HEHYVTLNL 
FLKRCFT FMDRGCVF KMVNNYISMFSSGDL K TLCQYKFDFLQEVCQHEHFI PLCL 



« + * + 



907 

910 

1019 

899 

942 

995 



964 

970 

1076 

956 

999 

1052 



1019 
1027 
1136 
1011 
1059 
1107 
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Cadherin EC m otif 

PMAFAKPKLQR VQDS - -NLEYS LSDEYCKHH FLVGljLLRE T S I 

FFMNADTAPTSP — CPSISSQNSSSCSSFQDQKIASMFD1»TSEYRQQHFLTGI LFTELAA 
PCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIANMFELSVPFRQQHYLAGI VLTE LAV 

PMPFGKGRIQR YQDL — QLDYSLTDEFCRNHFLVGI LLRE VGT 

PCCPLSPPASPSPSVSSTTSQSSTFSSQAPDPKVTSMFELSGPFRQQHFLAG^LLT^AL 

PIRSANIPDPLTP SES T QELHAS DMPE YS VTNE FCRKH FL I G 



LLREVGF 



1060 
1085 
1196 
1052 
1119 
1157 



ALQDN YE IRYTAI 



ALQEFR : 

ALEPEAEGAFltLHKKAI 



ALQEDQ 



SVIKNLLIKHAFDTRYQHKNQQAKIAQLYLPFVGLLLENlbRL 1116 

ILDAL ? — 1143 



ALDAEGEGISFVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAALYLPLVGI 

ILDPEAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKA^ 1254 

EJVRLIAI SVLKNLLIKHS FDDRYASRSHQARIATLYLPLFGLLIENV »I 1108 

SAVHSLLCGHDTDPRYAEATVKARVAELYLPLLSIARDTL ?-- 1177 

dvRHIJU^WLKNLMAKHSFDDRYREPRKQAQIASLYMPL 1213 



2j 



AGRDTLYSCA AMPN-S ASRDEFPCGFTSPANRGSLSTDKDTAYGS 1160 

QL CDFTVADTRRYRTSGSD 1162 

QLY DFTETHNQRGRPICIATDD — 1276 

NVRDVS P FPVNAGMTVKDESLALPA- VN PLVT PQKGS TLDNS LHKDLLGAI SG IAS PYTT 1167 

RLH : DFAEGPGQRSRLASMLDSDTE 1201 

YLKDLYPFTVNTSNQGSMDLSTNGGFQSQTAIKHANSVDTSFSKDVLNSIAAFSSIAIS 1273 



hq£ASP4 
hCfiASPS 
hc£ASP3 
hfi|ASP2 
hfi|ASP7 
h#LASPl 



FQ-NGHGIKREDSRGSLI PEGATGFPDQGNTGEN TRQSS TRS S VS QYNRLDQYE 1213 

EEQEGAGAI NQNVALAI AGNNFNLKT SGIVLSS LP YKQYNMLNADT 1208 

YESESGSMISQTVAMAIAGTSVPQLTR PGSFLLTSTSGRQHTTFSAES 1324 

STPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSVVRCDKLDQSE 1227 

GEGDIAGT INPS VAMAI AGGPLAPGSR AS I SQGPPTASRAGCALSAES 1249 

TVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPLALIGSTLRFDRLDQAE 1327 



hp!LASP4 
h£|*ASP5 
hgLASP3 
htLASP2 
KSASP7 
HCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



RSLLMCYLYIVKMISEDTLLTYWNKVSPQELINILILLEVCLFHFRYMGKRNIAR /HDA 1273 
tIrNLM I C FLW IMKNADQS L I RKW I ADLP S TQLNR I LDLL F I CVLC FE YKGKQS SDK /STQ 1268 
RS LLI CLLWVLKNADETVLQKW FTDLSVLQLNRLLDLL YLCVSC FEYKGKKVFER 4NSL 1384 
3kS LLMC FLY I LKSMS DDAL FT YWNKAS TSELMDFFT I SEVCLHQ FQYMGKRY I AR tfQEG 1287 
RTL1ACVLWVLKNTEPALLQRWATDLTLPQLGRLLDLLYLCIAAFEYKGKKAFER INSL 1309 
RS LLMC FLH IMKT I SYETLIAYWQRAPS PEVS DFFSI LDVCLQNFRYLGKR NI IRj OAA 1387 



* * 



— KSQTMPALRNRS GVMQARLQHLS S LES S 1311 

-LEEALLRGEGARGEMMRRRAPGNDRFPGLNEN 1311 

- LEEAI LGS I GARQEMVRRSRGQLERS PSGSAFGS Q 1430 

QTLPVSRNRTGMMHARLQQLGSLDNS 1323 

• - LEEAI LGT I GARQEMVRRSRERS P FGNPEN 1350 



WLSKHFGIDR 

VLQKSRDVKAR 

TFKKSKDMRAK 

LGPIVHDRKS 

TFKKSLDMKAR 

AFKFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHKQHRSQTLPI IRGKN 1442 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 

hCLASPl 



— LRWKKEQTHWRQANEKLDKTK? ELDQEALISGNLATEAHLIILDMQENI [QASS-ALD 

enlrwrkdmthwrqntekldksr; EIEHEALIDGNLATEANLIILDTLEIV /QTVS-VTE 

LTFNHSYGHSDJ DVLHQSLLEANIATEVCLTALDTLSLF rLAFKNQLL 

— VRWRKSVTHWKQTS DRVDKTKI EMEHEALVEGNIATEASLVVLDTLEI I /QTVM-LSE 

— alsnpkllqmldntmtsnsne: DIVHHVDTEANIATEGCLTILDLVSLF TQTHQRQLQ 



★ *** 
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NNDGHNPIHKKVFDIHIAFLKNGQSEVSLKHVFASLRAFISK^ 1419 
CKDS — LLGGVLRVLVNS LNC DQS TTY LTHC FAT LRAL I AKFGDLL FEEEVEQC FDLCH 1425 

SKES ILGGVIJCVLLHSMACNQSAVYLQHCFATQRALVSKFPELLFEEETEQCADLCL 1546 

ADHGHN P LMKKVFDVYLC FLQKHQS E T ALKNV FT ALRS L I YKFP S T FYEGRADMCAALC Y 1431 

ARES VLGAVLKVVLYSLGSAQSALFLQHGLATQRALVSKFPELLFEEDTELCADLCL 14 64 

QCDCQNS LMKRG FDTYML FFQVNQS ATALKHVFAS LRL FVCKFP S AFFQGPADLCG S FC Y 1560 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



EVLKCCTSKISSTRNEA3ALLYLLMRNNFEYTKRKTFLRTHLQI 1 1 AVS QL I ADVALS GG 1479 
QVLHHCS S SMDVTRSQACATLYLLMR — FS FGAT S N FARVKMQVTMS LAS LVGRAP D FNE 1483 
RLLRHCSSS IGT IRSHPSASLYLLMR — QN FE I GNN FARVKMQVPMS LSS LVGTS QN FNE 1604 
E I LKCCNS KLS S I RTEAS QLL YFLMRNN FDYTGKKS FVRTHLQVI I S VS QL IADWG I GE 1491 
RLLRHCGSRISTIRTHASASLYLLMR — QNFEIGHNFARVKMQVTMSLSSLVGTTQNFSE 1522 
EVLKCCNHRSRS T QTEASALLYL FMRKNFE FNKQKS I VRSHLQL I KAVSQL I ADAG - IGG 1619 



• • * 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hC|ASP4 
hOEuASPS 
hC|ASP3 
hfitASP2 
hfcyLASP? 

hCLASPl 



h&LASP4 
hgLASPS 
t&*LASP3 

fcjciuASP2 
h3LASP7 

tjELASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
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hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



SRFQESLFI INNFANSDRPMKATAPPAEVKDLTKRIRTVLMATAQWKEHEKDPEML I DLQ 
EHLRRSLRT I LAY SEEDTAMQ^iTP FPTQVEELLCNLNS I LYDTVKMREFQEDPEMLMDLM 
EFLRRSLKT I LTYAEEDLELRETT FPDQVQDLVFNLHMI LSDTVKMKEHQEDPEMLIDLM 
TRFQQSLS I INNCANSDIU*IKHTSFSSDVKDLTKRIRTVLMATAQMKEHENDP 
EHLRRSLKT I LT YAEEDMGLRDS T FAEQVQDLMETCLHMI LTDTVKMKEHQEDPEMLI DLM 
SRFQHSIAITNNFANGDKQMKNSNFPAEVKDLTKRIRTVLMATA^ 



* * • 
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t ran smembrane 



YSLAKSYASTPELRKTWLDSMAKIHVKNGI 



YRIAKSYQASPDLRLTWLQNMAEKHTKKKC YTEAAMCLVHAAALVAEYI SMLEDH 

YRIAKGYQTSPE-RLTWLQNMAGKHSERS* HAEAAQCLVHSAALVAEYI SMLEDR — 



YSLAKSYASTPELRKTWLDSMARIHVKNGI 



• • • 



. + . * * <4r * • ★ + 
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FS EAAMC YVHVAAL VAE Fl HRKK 



LS EAAMC YVHVTALVAE Y 1 TRKG 



YRIARGYQGSPDLRLTWLQNMAGKHAELGh HAEAAQCMVHAAALVAEY1 ALLEDQ 

YSLANSYASTPELRRTVfLESMAKIHARNGI LSEAAMCYIHIAALIAEYI KRKG YWKVEK I 



1539 
1543 
1664 
1551 
1582 
1679 



1592 
1598 
1718 
1604 
1637 
1739 



L FPNGCS AFKKI TPNI DEE GAMKE DAGMMD 

SYLPVGSVS FQN I S SNVLEESWSEDTLS PDEDGV 

KY LPVGCVT FQN I S SNVLEES AVS DDWS PDEEG I 

VFRQGCTAFRVI T PNI DEEASMMEDVGMQD 

RHLPVGCVS FQN I S SNVLEES AI S DDI LS PDEEG F 

CTASLLSEDTHPCDSNSLLTTPSGGSMFSMGWPAFLSITPNIKEEGAAKEDSGMHD 



ITAM 



VHYSEEVLLELLEQCVDGLWKAERYE 1 1 SE I SKLI VP I YEKRREFEKLTQV YRT1 HG 

C^GQYFTESGLVGLLEQAAELFSTGGLYETVNEVYKLVIPILEAHREFRKLTLT HSKI QR 
CSGKYFTESGLVGLLEQAAASFSMAQiYEAVNEVYKVLIPIHEAN^^ 

VHFNEDVLMELLE*aCADGLWKAERYELIADI YKLI IPIYEKRR 

C S GKH FTELGLVGLLEQAAGY FTMGGL YEAVNEVYKNL I P I LEAHRDYKKLAAV HGKL QE 
TP YNENI LVEQLYMCGEFLWKSERYELIADVNKPI IAVFEKQRDFKKLSDL YYD3 HR 



• * 



* • 
« 



1622 
1633 
1753 
1634 
1672 
1795 



1679 
1693 
1813 
1677 
1732 
1852 



ITAM 



DOCK motif 



DOCK motif 



J YTK3 LEVMHTKKRLLGT FEW AF YGQSFFEEEE GKEYI YKEE KLTGLSE ISLRLVK1 YG 
; FDS 1 VNKDH — KRMFGT YTR\ G F FG-SKFGDLDEQEFV YKEE AI TKLPE I SHRLEAi YG 
4fSK1 VHQSTGWERMFGT YFR\ GE YG-1 KFGDLeJeQE E V YKEE AI TKLAE I SHRLEG3 YG 

J E FFEDEE GKETt I YKEE KLT PLSE I S QRLLK3 YS 

? FTK3 MHQSSGWERVFG1 Y FRV GF YG-2 HFGDLE EQE E V YKEE S I TKLAE I SHRLEE I TT 
£ YLK\ AEWNSEKRLFGP YYR\ AE YGQG FFEEEE GKEK I YKEE KLTGLSE I SQRLLKI YA 



+ • 
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ITAM 
1739 
1750 
1872 
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IT AM 



o3FGT£NVKI I QDS DKVNAKELD PP YAH I Q\ IpTVKi 
QC FGAE FVEVI KDS T PVDKTKLDP> KAY I Q I 



ERFGEDWEVIKDSNPVDKCKLDP>KAYIQI T YVEl YFDTYEKKDRITYFDKNYNLRRFM 



DK FG SENVKM I QDS GKVNPKDLDS F YAY I Q\ T HVI 



* * 



IT AM 

YFDDKELTERKTEFERNHNISRFV 17 99 
liFVE^YFDEYEMKDRVTYFEKNFNLRRFtt 1810 

1932 



** . * . * * . 



EP FGDDWEI IKDSYPVDKSKLDSC KAYIQ] T YVEI YFDTYELKDRVTYFDRNYGLRTFL 
DK FGADNVK 1 1 QDSNKVNPKDLDPJ YAY I Q\ T YVT I FFEEKEIEDRKTDFEMHHNINRFV 1972 



g FFDEKELQERKTEFERSHNI RRFM 1770 

1851 



IT AM 



DOCK motif 



FEAPYTLSGKKQGCI EEQCKRRT I LTTSNS FP YVKJ RI P INCEQQI* LKPIDGATDEIKD 1859 
YTTPFTLEGRPRGELHEQYRRNTVLTTMHAFP YIKI RISVIQKEEFV LTPIEVAIEDMKK 1870 
YCTPFTLDGRAHGELHEQFKRKT I LTTSHAFP Y IK1 RVNVTHKEE I ! LT P I EVAI EDMQK 1992 
FEMP FTQTGKRQGGVEEQCKRRT I LT AI HC FP YVKf R I PVMYQHHTI LNP I EVAI DEMSK 1830 
FCTPFTPDGRAHGELPEQHKRKTLLSTDHAFP YIK1 RIRVCHREET\ LTPVEVAIEDMQK 1911 
FETPFTLSGKKHGGVAE.QCKRRTILTTSHLFP YVKPRIQVISQSSTi LNP I EVAI DEMSR 2032 
* . * * . • * * * * * * * • * # . • ***** *• • 

Coiled-coil 



* • • 



• • « 

* ♦ • 



KTAELQKLC S S TDVDM I QLQLKLQC WVSVQVNAGPLAYARAFLNDSQASKYPPKKVSELK 1919 
KTLQLAVAINQEPPDAKMLQMVLQC S VGATVNQG P LEVAQV FLAE I PADPKLYRHHNKLR 1930 
KTQELAFATHQDPADPKMLC^VLQC S VGTTVNQGPLEVAQVFLSE I PSDPKLFRHHNKLR 2052 
KVAE LRQLC S S AE VDM I KLQLKLQC SVSVQVNAGPLAYARAFLDDTNTKRYPDNKVKLLK 1890 
KTRELAFATEQDPPDAKMLQMVLQC SVGPTVNQGPLEVAQVFLAEIPEDPKLFRHHNKLR 1 97 1 
KVSELNQLCTMEEVDMI SLQLKLQC S VSVKVNAGPMAYARAFLEETNAKKYPDNQVKLLK 2092 

4.4 * **• #*i * ** ** • 



* • + * 



Coiled-coil 



DM FRKFI QACS I ALELNERLIKEDQVEYHEGLKSNFRDMVKELSDI IHEQILQEDTMHSP 1979 
LCFKEFIMRCGI AVEKNKRLITAIX}REYQQELKKNYNKLkENLRP^ I ERKI PEL YKP I FR 1990 

LCFKDFTKRCEI >ALRKNKSLIGPVQKEYQRELGKLSSP 2090 

EVFRQFVEACG(>ALAVNERLIKEDQLEYQEEMKANYREMAKELSEI J4HEQICPLEEKTS- 1949 
LCFKDFCKKCEDALRKNKALIGPDQKEYHRELERNYCRLREALQPI LTQRLPQLMAPTP- 2T)30 
EIFRQFADACG5ALDVNERLIKEDQLEYQEELRSHYKI>ILSELST\^1NEQITGRDDLSKR 21 52 

* . * . * * * 



** 



PD Z lioa nd 
WMSNTLHVFCAI SGTS SDRGYGS Pifcc&ESl — 
VESQKRDS FHRSS FRKCETQLSQGS 



2008 
2015 



VLPNSLHIFMAISGTPTSTMVHGMTS dSSWl 1980 

— PGLRNS LNRAS FRKADL — 2047 

GVDQTCTRVISKATPALPTVSIS aSAEV4 — 2180 
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